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Preface 


The origin of this book can be traced to NASA’s 
Applied Information Systems Research Program 
which was initiated to enhance space science 
productivity through the effective application of 
advanced computer science and technology. The 
program was developed because of the critical role 
that information systems and related technologies 
will play in meeting the scientific challenges of the 
1 990’s and beyond. The program also has relevance 
as we expand our research agenda toward under- 
standing the Earth as a system and eventually 
toward comprehending the origin of the universe. 

Unprecedented volumes of data will be generated in 
pursuing these objectives, which will in turn require 
analysis and interpretation that will lead to mean- 
ingful scientific insight. Providing a widely distrib- 
uted research community with the ability to access, 
manipulate, analyze, and visualize these complex, 
multidimensional data sets depends on a wide range 
of computer science and technology topics, and this 
book addresses many of the latest developments 
that service this critical need. Data storage and 
compression, database management, computational 


methods and algorithms, artificial intelligence, 
telecommunications, and high-resolution display 
are just a few of these topics. 

The individual contributions are based on papers 
presented at a special session of the spring 1 993 
meeting of the American Geophysical Union 
(AGU). The session, “Advanced Data Handling and 
Visualization Tools for Space and Atmospheric 
Sciences,” was a first for AGU in terms of its 
content and its use of on-line demonstrations of 
tools, technologies, and scientific insights. The 
need for interactivity, speed, user-friendliness, and 
extensibility is a unifying theme that the reader will 
find addressed throughout the papers. 

The past 3 years have seen a major breakthrough, 
with newly developed systems beginning to appear 
on the desktops of working scientists and data 
analysts. We are on the threshold of a new and 
exciting era in science productivity as these capa- 
bilities evolve and attain broader application and 
interoperability. 
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Expanding Missions of Space 
and Earth Sciences 


E.P. SZUSZCZEWICZ 

Laboratory for Atmospheric and Space Sciences 

Science Applications International Corporation 
McLean, VA 

The movement toward the solution of problems involving 
large-scale system science, the ever-increasing capa- 
bilities of three-dimensional, time-dependent numerical 
models, and the enhanced capabilities of “in situ" and 
remote sensing instruments bring a new era of scientific 
endeavor that requires an important change in our 
approach to mission planning and the task of data 
reduction and analysis. Visualization is at the heart of 
the requirements for a much-needed enhancement in 
scientific productivity as we face these new challenges. 
This article draws a perspective on the problem as it 
crosses discipline boundaries from solar physics to at- 
mospheric and ocean sciences. It also attempts to intro- 
duce visualization as a new approach to scientific dis- 
covery and a tool which expedites and improves our 
insight into physically complex problems. A set of simple 
illustrations demonstrates a number of visualization 
techniques and the discussion emphasizes the trial-and- 
error and search-and-discovermodes that are necessary 
for the techniques to reach their full potential. Further 
discussions also point to the importance of integrating 
data access, management, mathematical operations, and 
visualization into a single system. Some of the more 
recent developments in this area are reviewed. 


1. THE GRAND CHALLENGE OF 

LARGE-SCALE SYSTEM SCIENCE 

Space and Earth Science is shifting in emphasis 
from single exploratory investigations to concentra- 
tion on collaborative missions of large-scale system 
phenomena producing unprecedented volumes of 
data with unmatched temporal, spectral, and spatial 
resolution. The multi-parameter, multi-platform data 
bases will challenge our intellectual capabilities to 
absorb, synthesize, and effectively analyze the 
measurements, as well as impose new demands on 
software and hardware systems as we know them 
today. 

The large-scale numerical codes intended to 
model, interactively analyze, and ultimately predict 
the observed phenomena are of an equal challenge, 
requiring their implementation on massively parallel 
computers with performance characteristics ap- 
proaching the teraflop range. These codes, whether 
they address individual disciplines like solar, 
heliospheric, or atmospheric physics, or the coupling 
of solar-terrestrial processes that control our 
weather, have commonality in their complexities. 
They all involve three-dimensional time-dependent 
geometries, large variations in temporal and spatial 
scales, complex physical interactions, and most often 
non-linear dynamics. 

If we are to meet the challenge of the National 
Academy of Sciences to be able to predict the 
Earth’s environment in the context of its position in 
the solar system and in the context of global change, 
we must: (1) develop the systems which can 
efficiently gather and process the data, (2) access, 
compare and “tune” the appropriate models, and 
(3) integrate the results into a predictive capability. 
Here the integration of human intelligence with 
hardware and software systems, in addition to 
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appropriately designed visualization tools, plays a 
pivotal role if these ambitious goals are to be 
achieved and a predictive capability developed. 

In the domain of solar and heliospheric physics, 
we expect three-dimensional magnetohydrody- 
namic (MHD) models to be among the basic tools 
for a coherent hierarchical approach to the study of 
solar processes and their control of the global 
heliosphere from its coronal origins to its interac- 
tions with the Earth’s bow shock and the interstellar 
medium. The origins of the solar wind as well as 
coronal mass ejections (CMEs) must be modeled to 
help interpret ground-based and spacecraft data, 
such as that from the Solar Heliospheric Observa- 
tory (SOHO). Similarly, the interaction of solar 
wind streams, the propagation of CMEs, and the 
structure of the heliosphere out of the ecliptic must 
be modeled and compared with data from the 
Ulysses spacecraft. And the distant interaction of 
the solar wind with the local interstellar medium, 
leading to the formation of the solar wind termina- 
tion shock, the heliopause, and the heliospheric bow 
shock, must be modeled to help interpret data from 
the Pioneer and V oy ager spacecraft now reaching 
towards the edge of the solar system. 

The work of the solar and heliospheric commu- 
nities includes the requirements of magnetospheric 
physicists who will be more intent in looking out 
into the interplanetary medium and dynamic solar 
phenomenologies to understand the cause-effect 
relationships in their discipline area of geospace. 
Upper atmospheric and thermospheric physicists 
will continue to be more keenly aware of the need 
for an understanding of ionospheric coupling 
mechanisms and the controls from the magneto- 
sphere and the heliosphere above, and the strato- 
sphere below. In addition, scientists studying the 
Earth ’ s ionospheric-thermospheric-mesospheric 
system will be looking to Mars, Venus, and beyond 
for comparative studies of large-scale planetary 
environments that not only form an intrinsic ele- 
ment in solar-system exploration, but provide 
unique opportunities to determine the limits of 
Earth-based concepts and extend our fundamental 
understanding of the processes under study. 

Atmospheric and ocean scientists will be 
looking for improved global-scale measurements of 
sea state and surface winds to better understand air- 
sea interactions, heat transfer mechanisms, weather, 


and climate. And the hydrologic cycle will be an 
experimental and theoretical focus because of its 
important role in creating and modulating the 
various climatic zones on the Earth. The global 
circulation models will therefore require the inte- 
gration of large volumes of environmental data, 
including rain-rate and rain-distribution data from 
ground-based radars, rain gauges, and spaceborne 
sensors like those included in the Tropical Rainfall 
Measurement Mission (TRMM). 

The increased focus on large-scale system 
phenomena, the cross-disciplinary nature of many 
investigations, and projections of increased vol- 
umes of “in situ” and remote sensing data are all 
elements in the coming decade of Earth and Space 
Sciences. These projects range from new astro- 
nomical observatories such as the Space Telescope, 
planetary orbiters such as Galileo and Cassini, the 
multi-spacecraft International Solar Terrestrial 
Physics (ISTP) and TIMED missions, and the EOS 
mission to planet Earth. ISTP alone is expected to 
accumulate almost a gigabit of data per second. 

To be effective in their work scientists will 
require a data analysis environment with sophisti- 
cated interactive graphics tools based on advanced 
visualization techniques. Such techniques represent 
one of the most effective ways to “compress” 
complex data into a visually organized form which 
is optimized for analysis and interpretation. The 
scientists also require easy-to-use mathematical and 
image processing tools. In addition, they will need 
tools to assist in obtaining data from remote ar- 
chives as well as to access the results of empirical 
and numerical models to correlate with the data and 
assist in analysis and interpretation. To be most 
effective, these tools must be integrated into a 
single user-friendly system. 

2. VISUALIZATION: THE PATHWAY 

TO ENHANCED SCIENTIFIC 

PRODUCTIVITY 

Visualization is at the heart of our requirements 
for the much needed enhancement in scientific 
productivity as we face the grand challenge of large- 
scale system phenomena in space and Earth sci- 
ences. This is not to underplay the important roles of 
remote data access or easy-to-use mathematical 
tools, because they are the fundamental elements 
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with which the visualization process begins. 

In itself, scientific visualization is a rapidly 
evolving field. But the reality is that we have been 
applying visualization techniques in one form or 
another during our entire scientific careers. What 
has changed has been the magnitude of the prob- 
lems we are addressing, the sensitivity and dimen- 
sionality of our instruments, the volume of our data 
base, and the hardware, software, and computa- 
tional tools to interpret and assimilate the results. 
Visualization is not new. We are simply in a new 
era that requires a dramatic change in our approach 
and our attitudes toward data handling and analysis. 
This attitude must recognize that visualization is 
not a “pretty picture,” but rather a process of 
discovery that links the knowledge of the discipline 
with the knowledge of visualization tools. In the 
past, what may have been called visualization was 
really “data display,” that is, the graphical render- 
ing of the data when we knew what it meant or 
when we knew what we wanted it to say. On the 
other hand, the “visualization process,” as we have 
come to know it today, is the intersection of intel- 
lectual pursuit with a rendering of the data or model 
results which leads to discovery. The rendering 
involves the use of color, intensity, transparency, 
texture, animation and many other techniques 
which can convey a tremendous amount of informa- 
tion in a single image and in a short period of time. 
The overlay of images with varying degrees of 
transparency and texture, or the superposition of 
scalar and vector fields, or a sequence of images 
(i.e., animation) can yield more information than 
the sum of the parts. This allows our intellectual 
capabilities to parallel process the results as no 
machine can do, and couples the knowledge of 
our discipline with the knowledge of the visualiza- 
tion tools. 

Emphasizing its functional aspects, visualiza- 
tion can be thought of as having two major compo- 
nents: exploration and presentation [Keller and 
Keller, 1993]. In the exploration mode, we search 
the data or model results for new relationships or 
insights into complex phenomena. Often this means 
trial-and-error representations of the data and/or 
model results, requiring interactive adjustments of 
the data or the image and the direct participation of 
the scientist’ s knowledge of the discipline. It is in 
this mode that we emphasize visualization as a 


“process” rather than the graphical end-product of 
the “presentation” mode in which the arrangement 
and display of the data is for publication and the 
benefit of others. 

It must be emphasized that the exploitation of 
visualization technologies and the enhancement in 
scientific productivity require a somewhat non- 
traditional approach to handling and rendering the 
data and model results. Unfortunately, many 
scientists have been conditioned to regard data and 
model outputs as entities with inviolate properti es 
that ultimately dictate the use of standard displays 
and presentation formats for the data. Such rigid 
thinking precludes the exploitation of visualization 
techniques and undermines the exploration phase 
where trial and error in the coupling of knowledge 
of the discipline with the knowledge of the visual- 
ization tools is the pathway to discovery. 

3. ILLUSTRATIONS 

Recognizing that visualization is a process 
involving tools that include the use of color, trans- 
parency, texture, etc., along with rotation, pan, 
zoom, and animation [see e.g. Friedhoff and Benzon, 
1989; and Nielson et al., 1990], this section presents 
simple illustrations of several visualization tech- 
niques to highlight their utility in unfolding complex 
data, in developing quick insights into a problem, 
and in conveying large amounts of information in 
short periods of time. 

Color vs. isolines and multi-parameter over- 
lays. Color is one of the simplest tools for visualiz- 
ing data and it finds extended use in identifying 
morphologies in a scalar field. Isolines can do this 
as well, but color can identify morphologies when 
isolines fail. This is illustrated most simply with the 
blue-to-red color rendering of the scalar field of 
numbers ranging from 0 (blue) to 9 (red) in Fig- 
ure 1 . This might be a field of temperature measure- 
ments over a land mass or over the ocean. It might 
also represent plasma density distributions under 
disturbed conditions in the polar cap ionosphere. In 
any case, the visualization task must unfold unique 
morphological features (i.e., concentrations, rar- 
efactions, etc.). Isolines would fail in this applica- 
tion, since they would yield a tangled net of “spa- 
ghetti” that provides no simple organization of the 
data. On the other hand, the use of color immedi- 
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ately identifies a ‘‘hot spot” of 9s near the center of 
the image in Figure 1. 

Color has found great utility in a number of 
disciplines where there is the need for multi- 



Figure I. A color rendering of a sample two-dimensional array 
of numbers (0 [ blue/ through 9 [ red J) mimicking any scalar field 
of measurements where morphologies or topologies need to be 
determined. The color representation of the numerical field 
clearly reveals a concentration of higli-scale measurements (9s) 
near the center of the image, and a random distribution elsewhere 
[from A. Mankofsky. private communication (1993)]. 
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parameter overlays in endeavors focused on under- 
standing a phenomenon rather than a single data set. 
A typical application is in thermospheric physics, 
where two dimensional displays in latitude and 
longitude can, for example, depict atomic oxygen 
densities in color, molecular nitrogen densities as 
isolines, and winds with vectors (see Figure 2). Such 
an image is effective, but only at a given altitude. 
Understanding the full three-dimensional, time- 
dependent aspect of thermospheric circulation and 
its coupling to the global-scale ionosphere under 
magnetic storm conditions requires more advanced 
visualization tools than those demonstrated in 
Figure 2. Such tools include isosurfaces, streamlines, 
transparency, texturing, and animation, all of which 
are available in a number of commercial software 
packages (e.g., AVS from AVS, Inc., Data Explorer 
from IBM, etc.), and some of which have been 
implemented already in developmental applications 
[e.g., Foster et al., 1994; Twiddy et al., 1994; and 
Gekelman, 1994]. 

Isosurfaces and the need for a non-traditional 
approach to data presentation. The exploratory 



Figure 2. Illustration of color overlays to superimpose geophysical parameter sets in a study of the Earth 's thermosphere. Image shows 
the global distributions of winds ( white vectors), molecular nitrogen densities ( color-coded isolines), and atomic oxygen densities (full 
color-coded cut-plane) at 300 km as specified by the MSIS model [Hedin. 1991 ]. 
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Figure 3. Illustration of a non-traditional approach to rendering 
data and modeling results. Visualizing the Earth 's geomagnetic 
field with an isosurface application to the IGRF model 
immediately reveals the phenomenological domain called the 
South Atlantic Anomaly. See text for discussion. 

nature of the visualization process requires that 
scientists free themselves from rigid historical 
views on the display and presentation of their data. 
When this is done the opportunity presents itself for 
entirely new perspectives on the data and associated 
insights into the prevailing physical principles. This 
is illustrated in Figure 3 with a non-traditional 
approach to the representation of the Earth’ s 
geomagnetic field using the International Geomag- 
netic Reference Field model [IGRF. available from 
the National Geophysical Data Center, Boulder, 
CO], 

Typically, there are two standard representa- 
tions of the Earth's field. The first involves two- 
dimensional isocontours (i.e., isolines) of B-field 
values at a fixed altitude. The second is a fieldline 
representation that demonstrates the field’s dipolar 
nature. A non-traditional approach is to view the 
geomagnetic field with a three-dimensional 
isosurface of sequential B-field values. One such 
image for the B=0.5 and 0.265 Gauss isosurfaces is 
presented in Figure 3, which shows the B=0.5 
surface (yellow) confined to the polar regions. The 
B=0.265 surface is more global, with the image 
showing it furthest from the Earth near the north 
and south polar regions, closest to the Earth at mid- 
and equatorial latitudes, and intersecting the Earth 
in the South American/South Atlantic region. This 
intersection locates the well-known South Atlantic 


Anomaly where the field at the surface of the Earth 
has anomalously small values and where, as a 
consequence, energetic particles are known to 
penetrate to very low altitudes [Ratcliffe, 1972], 
This picture presents an intuitive insight into the 
causality, recognizing that the isosurface represents 
an approximation to the magnetic pressure ( B 2 /8 tc) 
and that energetic particles have higher probabili- 
ties of atmospheric entry in the South Atlantic 
“pressure well’’ that exists naturally in the Earth's 
B-field topology. One can only speculate on the 
impact that such a visualization tool may have had 
in early studies of the Earth’s field and its control of 
charged particle dynamics. 

The visualization process in data reduction and 
model-measurement comparisons. The products of 
visualization tools are not always representations of 
data and model runs. They often provide tests of 
model or algorithm integrity and time-saving views 
of an experiment scenario that facilitate determina- 
tion of the shortest path in the process of data 
reduction and analysis. These aspects of visualiza- 
tion are illustrated in an application to a rocket- 
borne chemical release experiment in the NASA/ 
CRRES program that involved "in situ’’ and 
ground-based systems and aspects of coupled 
phenomena along geomagnetic field lines. 

The first panel of Figure 4 shows the rocket 
trajectory along with an array of B-field vectors 
determined by the IGRF. The obvious discontinuity 
in the B-field representation revealed a problem in 
an interpolation module that may have gone unde- 
tected without this simple visual aid. 

The second panel of Figure 4 illustrates the 
application of visualization tools to help understand 
the actual configuration of the experiment. In the 
simplest description of the experiment scenario, a 
Ba/Li mixture in a sealed canister was separated 
from the rocket’s diagnostics payload and the 
vaporized gases were released on the upleg portion 
of the trajectory at an altitude near 280 km. The 
colored disk in the panel is a color-coded model 
depiction of the cloud’s ion density in the plane 
perpendicular to the cloud’s bulk velocity at a very 
early phase of the expansion process. The red line is 
the suborbital trajectory, and the long and short 
vectors, respectively, are the B-field and payload 
velocity at the point of the release. The cone-shaped 
object in the figure is the projection of a ground- 
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Figure 4. Illustration of visualization as an integral process of understanding an experiment configuration and optimizing the 
approach to data reduction and analysis. See text for details. 


based hf heater beam intended to intercept the 
chemical cloud at its ionospheric altitude, excite the 
electrons, and induce enhanced optical emissions 
and plasma instability processes. 

The initial process of understanding the actual 
configuration of the experiment involved the follow- 
ing questions: 

1 . What were the magnitude and directions of 
the cloud’s bulk velocity relative to the 
geomagnetic field? 

2. How well was the chemical release coupled 
along the B-field to the diagnostic payload? 

3. How effectively did the heater beam inter- 
cept the cloud? 

4. Was it possible to observe expanding cloud 
ions on the downleg portion of the 
trajectory? 

The first three questions were answered using 
rotation and zoom tools, with some of the image 
products presented in the middle and right-hand 
panels of Figure 4. The last question was answered 
with yet another rotation that projected an image 
perpendicular to the plane of the trajectory, with the 
downleg portion of the trajectory in the foreground. 
That image, presented in the right-hand panel of 
Figure 4, shows that the B-field at the release site 
did not intersect the downleg trajectory. A simple 
hand-scaling of altitude and range showed that the 


fluxtube passing through the release missed the 
downleg trajectory by 13 km, a sufficiently great 
distance to minimize interests in searching the data 
for possible late-time ion signatures of the expansion 
process on the downleg trajectory. 

The entire exercise of answering the questions 
listed above required less than five minutes. Without 
the visualization tools, the answers would have 
required the development of a number of mathemati- 
cal algorithms and a sequence of stack plots. That 
exercise could have taken 3 to 5 days, depending on 
the proficiency of the programmer. 

This illustration is one of many that could have 
been presented to highlight visualization as a time- 
saving and insightful element in the process of data 
reduction and analysis. This is especially true in 
situations that require co-registration of spacebome 
“in situ” and remote sensing techniques supported 
by ground-based diagnostics. For example, the 
NASA TIMED mission will rely heavily on 
spacebome and ground-based remote sensing 
techniques in its attempt to understand the coupling 
of the ionospheric, thermospheric and mesospheric 
domains and the influence of magnetospheric 
processes. The three-dimensional mapping of gravity 
wave influences, dynamic changes in neutral compo- 
sition, field-line integrated conductivities, and 
penetrating electric field events are among the many 
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issues that will need to be addressed. This will 
require considerable care in space and time correla- 
tions along with co-registration of instrument fields- 
of-view with coupled regions of the mesosphere, the 
lower thermosphere, and the D, E, Fj, and F 2 regions 
of the ionosphere. 

On the rendering of three-dimensional processes 
with isosurfaces, cut-planes, and transparencies. 

The problem of co-registration and space-time 
correlations among measurement platforms and 
instrument types can be particularly acute in investi- 
gations of the solar wind-magnetosphere-ionosphere 
(SW-M-I) system. Such investigations involve an 
incredible volume of space with prevailing phenom- 
ena that span the spatial spectrum from meters to 
several Earth radii. The coupling mechanisms within 
the SW-M-I system are at the center of the NASA 
ISTP and the NSF GEM programs, with a focus on 
the flow of energy and momentum into and through 
the system. The difficulties include the complexities 
of the interacting phenomena, the large set of 
controlling parameters, and the coupled domains of 
varying dimensions. 

The utility of several visualization tools in the 
study of the SW-M-I is illustrated here through 
applications to the MHD model of J. Lyon [Fedder 
and Lyon, 1987] in Figure 5. This figure, the product 
of the SAYS data acquisition and visualization 
system [Szuszczewicz, et al., 1994], has been 
created to address a Gedanken Experiment that 
attempts to determine the flow of momentum into 
specified regions of space under steady-state and 
dynamic environments. The upper panel presents the 
following elements: (1) the inner boundary of the 
computational domain at 3.5 Rp in red, (2) open and 
closed field lines in white, and (3) a pair of transpar- 
ent isosurfaces (i.e., isobars of constant plasma 
pressure) with a color scale that represents the 
magnetic pressure term B 2 /8 tc everywhere on the 
isobaric surface. (Transparency is used to show the 
B-field structure within the isobaric surface). 

The lower panel is a two-angle rotation (i.e., 
pitch and roll) of the top image with the 
transparency of the isobaric surfaces eliminated. The 
cut-plane, which is perpendicular to the plane of the 
B-field lines, presents: (1) bulk plasma velocity 
vectors in white, (2) the plasma density in a slightly 
transparent color code, and (3) the “spider web” 
computational grid of the MHD model which 


provides varying degrees of resolution in the 
computational domain. This image sets the stage for 
an improved understanding of the topologies and 
morphologies in the overall system and for the 
calculation of momentum and energy flows into and 
out of critical domains. In this case the Gedanken 
Experiment is concentrated on the isobaric surfaces 
whose topology we see is controlled by the lowest- 
latitude open field lines in the nightside hemisphere. 

These are just two of many possible images that 
can be at the researcher’s disposal. They clearly 
display the morphological relationships among the 
computational grid, the field configuration, the 
kinetic and magnetic pressures, the plasma density, 
and the bulk flow velocities — all elements critical to 
the calculation of energy and mass transfer. Without 
such images, insight and understanding of the 
problem would be impaired. 

4. THE NEED FOR AN INTEGRATED 

SYSTEM 

Recognizing that visualization is a process that 
enhances insight into the “reality” of physically 
complex problems is just the beginning for enhanced 
scientific productivity in the expanding missions of 
space and Earth sciences. Visualization is indeed a 
condition without which we cannot effectively 
address the grand challenges of the coming decade. 
Scientific productivity demands an interactive data 
analysis environment with advanced visualization 
tools, because such an approach represents one of 
the most effective ways to compress complex data 
into a visually-organized form that is optimized for 
analysis and interpretation. 

While it is generally agreed that easy-to-use 
visualization and data handling tools are vital to 
efficient and insightful progress in science, the 
applications development environment has been 
characterized as fragmented, with little overall 
direction or coordination [Botts, 1992], It has been 
argued that the primary bottleneck is the lack of 
adequate software which allows the scientist to take 
advantage of today’s computing power and interac- 
tively visualize and analyze his/her data within the 
complex computing environment. The end result is 
that the largest fraction of the scientific community 
is not using the visualization capabilities available 
today because (according to Botts): (1) the tools are 
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Figure 5. Illustration of imaging and visualization perspectives in addressing magnetospheric problems. Based on the MHD model 
of Fedder and Lyon ( 1987), this figure includes representations of the computational grid along with plasma and magnetic field 
pressure, bulk velocities, plasma densities, and open/closed field lines. See text for complete discussion. 
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not extensible or are too inflexible, (2) it is too 
difficult to input data, (3) the tools do not adequately 
link visualization and analysis, and (4) the tools do 
not meet scientific expectations. It is also agreed 
throughout the scientific community and effectively 
argued by Botts that the main general areas needing 
consideration include: 

1. integration of visualization with data man- 
agement and analysis; 

2. a framework of flexibility without complex- 
ity; and 

3. the development of an architecture focused 
on the scientist as the end user. 

This requires that the tools meet the actual needs 
of the scientist, be simple and intuitive to use, and be 
logical to the scientist rather than to a computer 
specialist. 

Many of these criticisms are currently being 
addressed within NASA’s Advanced Information 
Systems Research Program in the Office of Space 
Science, with progress having been reported recently 
in a special session of the American Geophysical 
Union [EOS 74, No. 16, April 20, 1993/Supplemen- 
tal] and its published proceedings [Szuszczewicz and 
Bredekamp, 1994; this volume]. As an illustration, 
the works of Berchem et al. (1994), Jacobsen and 
Berkin (1994), Szuszczewicz et al. (1994), Peredo et 
al. (1994), Russell (1994), and Searight et al. (1994) 
are developing integrated approaches to the access, 
display, and analysis of multivariate multidisci- 
plinary data sets. In a number of cases (e.g., 
Berchem et al., Szuszczewicz et al., and Peredo et 
al.), the integrated approach includes multiformat 
data readers, access to large-scale numerical 
models, and the superposition of satellite orbit 
tracks for model-measurement comparisons. The 
efforts of Jacobsen and Berkin ( 1 994) and 
Szuszczewicz et al. (1994) emphasize ease, func- 
tionality, and extensibility with simple push-button 
interfaces to data acquisition, mathematical pro- 
cessing, and visualization tools in an interactive 
environment that addresses a broad spectrum of 
space and Earth sciences. 

For these systems to reach their full potential 
there is the intrinsic need for scientists to open their 
minds to the broad spectrum of capabilities imbed- 
ded in the visualization tools that are being made 
available. As discussed earlier, many scientists have 
been conditioned to regard data and model outputs 


as entities with inviolate properties that dictate the 
use of standard displays. This attitude must be 
abandoned for the full potential of visualization to 
be realized. It is a rapidly developing field, that 
holds great promise. But like any resource it must 
be effectively mined. 
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Advanced information systems capabilities are essen- 
tial to conducting NASA 's scientific research mission. 
Access to these capabilities is no longer a luxury for a 
select few within the science community, but rather an 
absolute necessity for carrying out scientific investiga- 
tions. The dependence on high performance computing 
and networking, as well as ready and expedient access 
to science data, metadata, and analysis tools is the 
fundamental underpinning for the entire research en- 
deavor. A t the same time, advances in the whole range 
of information technologies continues on an almost 
explosive growth path, reaching beyond the research 
community to affect the population as a whole. Capi- 
talizing on and exploiting these advances are critical 
to the continued success of space science investiga- 
tions. NASA must remain abreast of developments in 
the field and strike an appropriate balance between 
being a smart buyer and a direct investor in the tech- 
nology which serves its unique requirements. Another 
key theme deals with the need for the space and com- 
puter science communities to collaborate as partners 
to more fully realize the potential of information tech- 
nology in the space science research environment. 


1. INTRODUCTION 

The 90s and beyond promise to be a very 
exciting and challenging era for conducting science 
from space. During this era we will be afforded the 
opportunity to observe the universe in all wave- 
lengths of the electromagnetic spectrum, and 
compare those observations with theoretical models 
and simulations of the origin and evolution of the 
universe. We will continue to explore all the bodies 
in our own solar system while also seeking the 
existence of others. The generation of solar energy, 
its transport, and its interaction with the terrestrial 
environment will be measured. Sustained experi- 
ments in the microgravity environment of space 
will be conducted. And, finally, we will study the 
Earth as a system while determining the extent, 
causes, and consequences of global climate change. 

We are challenged to carry out this grand, 
scientific endeavor within tight budget constraints, 
and continue to fulfill our responsibility to contrib- 
ute to America’s overall economic vitality. Fur- 
thermore, we are challenged to strengthen the 
relevance of our science mission with a more direct 
connection to the well-being of all Americans. It is 
imperative to broaden our outreach efforts to share 
the excitement of scientific discovery with the 
general public and to infuse the products of in- 
creased knowledge and understanding into educa- 
tional curricula as a true legacy to the future. 

Advanced data and computing systems and the 
full suite of related information technologies are 
critical, if not enabling, in achieving our space 
science objectives. High performance computing 
and networking, along with sophisticated software 
tools and techniques, to manipulate, analyze, and 
visualize complex data sets, are an integral part of 
the research environment for extending scientific 
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understanding and insight. NASA must remain 
abreast of this rapidly evolving area of technology 
to exploit advances and ensure a close coupling to 
the science research environment. NASA’s ap- 
proach to ensuring this coupling is based on three 
principal elements: 

The provision for state-of-the-art informa- 
tion systems services with reliable and 
expedient network access to data, easily 
transcended hierarchy of computing capa- 
bilities. and interactive analysis tools. 

The application of new technology to 
support growing demands and enable new 
research methods. 

The involvement of the science community 
from the onset to foster partnerships among 
computer scientists, technologists, and 
space scientists. 


We will now go into detail about some of the 
challenges associated with the current and future 
environment, the opportunities afforded through 
information technology, and examples of how the 
above strategy is being pursued. 

Figure 1 serves as a frame of reference for this 
discussion by providing an overview of functions 
and services associated with science information 
systems. 

2. TRENDS AND CHALLENGES 

Trends for future missions point toward more 
complex instruments with sharply increased data 
rates. In contrast with earlier single purpose 
investigations, involving observations from one 
instrument, future investigations will be much more 
interrelated in terms of dependence on multiple 
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Figure 1. Functional overview of the science information systems environment. 
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sources of data across missions, instruments, and 
ground sources. Complexity of observations will 
also increase with investigations requiring coordi- 
nated and often simultaneous observations in 
multiple wavelengths with high spatial, temporal, 
and spectral resolution. 

New patterns and research styles are anticipated 
for analyzing data and producing scientific findings. 
The value of the data assets acquired from space 
extends far beyond the life of the mission itself with 
data used in scientific studies not originally envi- 
sioned and dynamically defined as the heuristic 
process evolves. Collaborations are increasingly 
international in scope. Broad, scientific questions 
will be multidisciplinary in nature and involve 
widely dispersed investigator teams. The teams will 
require the combination and analysis of data from 
multiple sources. The need for comprehensive data 
archiving and distribution systems will be increas- 
ingly important. It will transcend individual project 
and science discipline boundaries. 

The principal product of the NASA science 
program is increased knowledge and understanding, 
which are frequently represented in the form of 
models, which simulate and predict the phenomena 
and/or physical processes in question. The ultimate 
test of that understanding is the closure between the 
theory and observation, or how the process as 
modeled compares with observed events. Therein 
lies another set of challenges to the information 
systems environment. In striving toward the goal to 
represent and predict physical processes on a broad 
range of spatial and temporal scales, there are 
several factors that drive increases in computational 
intensity and complexity: 

Resolution — Refining grid sizes to more 
accurately account for fine-scale effects, 
etc. 

Sophistication — Allowing for better physics 
with more realistic descriptions replacing 
simplifying assumptions, with improved 
algorithms and techniques, with better 
parameterizations, etc. 

Duration — Conducting many simulations to 
carry on true scientific experiments, as 
opposed to anecdotal studies. 

Comprehension — Visualizing and interacting 
with results. The ability to view the event 
from the inside and from all aspects and 


perspectives. Just as importantly, the 
ability to convey the results so that col- 
leagues and laymen alike can grasp the 
meaning. 

The combined effect of the trends and require- 
ments in multidisciplinary space science research is 
virtually unique in its demands for computing power, 
data handling capability, and scientific visualization. 

3. OPPORTUNITIES 

Information technology is one of the most 
rapidly advancing technology areas, with profound 
effect on, not only research and engineering, but 
also on almost every household. These technolo- 
gies are part of the information revolution which is 
stimulating economic growth and improving the 
quality of life. The National Information Infra- 
structure or “information highway” has become a 
part of the universal parlance and is increasingly 
evident in the popular news media and commercial 
offerings. 

Trends in technical capabilities in this area are 
dramatic and far-reaching. Today’s workstation is 
yesterday’s supercomputer, which was only avail- 
able from a central computing complex. Wide-area 
networking has revolutionized the way science is 
done by putting computing resources, data re- 
sources, and collaborators into a virtual laboratory 
setting. Similarly, very high volume data storage 
and management capabilities, along with affordable 
multi-media presentation options, are available to 
virtually every researcher, if not every home. 

Every aspect of NASA’s science activity profits 
from continuing advances in information technol- 
ogy, ranging from operations aspects, through 
science data product generation, to analysis and 
interpretation, comparison with theory, and presen- 
tation and defense of results. In the area of on- 
board processing for instance, microelectronic 
miniaturization and packaging have opened a wide 
range of possibilities. Advanced flight computers 
with greatly enhanced processor speed, memory 
capacity, and reconfiguration alternatives wil 1 allow 
more sophisticated on-board processing for autono- 
mous operations, intelligent image registration and 
classification, compression, selective downlink, etc. 
This technology now makes feasible flight system 
vs. ground system trade-offs to influence design 
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decisions with high potential for reducing burden- 
some operations costs, which is a prime target area 
for creating funding flexibility for future flight 

opportunities. 

In addition, continued improvements in auto- 
mation, expert systems, and telepresence methods 
will increase investigator interaction and remote 
control. These capabilities will extend to school 
children and the general public so they can share 
the experience of being there, whether to fly by a 
nearby planet, to get the daily weather report from 
Mars, or to drive a rover. 

There is not a more striking example of where 
the integrated effect of advances in information 
technology combine to benefit the science process 
than in the area of visualization. Advanced com- 
puting engines to drive real-time rendering for 
three-dimensional animations; high resolution 
image processing and display devices; the ability to 
locate, retrieve, fuse, and display widely distributed 
data sets; and advanced algorithms and interactive 
analysis techniques have unlimited potential for 
unlocking the scientific puzzles held captive within 
the massive amount of data. This applies to both 
observational data and that generated as output by 
highly sophisticated models and analysis systems. 

It is important to note that these visualizations are 
more than pretty pictures. They are keys to the 
discovery process leading to insight and under- 
standing and have the ability to convey that insight 
to others. Prospects in the area of virtual reality are 
also exciting with interactive exploration and 
experience, with the viewer embedded within the 
visualizations as an actual participant. 

Several principles will guide the evolution of 
information systems support for the science re- 
search community. A portion of direct science 
program funding will be dedicated to assure reli- 
able and robust data, computing, and communica- 
tions infrastructure as a vital supporting element for 
the science mission. This will extend and build on 
today ’ s NASA Science Internet to connect NASA 
assets consisting of computing resources, data 
centers, analysis tools, and science users into a 
worldwide network of collaborators, resources, and 
services. 

In order to effectively apply advances in 
technology, a coherent and organized process needs 
to be employed. This process must identify, assess. 
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select, evaluate, demonstrate, and infuse developing 
technologies. Furthermore, the science community, 
as recipient and beneficiary, must be involved in the 
process from the onset. The assurance of a close 
coupling between information technology and 
science needs is predicated on team efforts involv- 
ing technologists and system providers, coupled 
with a strong science application wrap-around. The 
Applied Information Systems Research component 
of the Science Information Systems Program was 
established on that basis. Figure 2 illustrates some 
examples of complementary research and tool 
developments sponsored under the auspices of that 
program and applied to actual science investiga- 
tions. They are also representative of the underly- 
ing motivation for organizing the special session at 
the meeting of the American Geophysical Union, 
which in turn led to the publication of this book as a 
collective set of contributions to that session. 

Another means for exploiting the technology 
curve for science benefit is through coordination 
and participation in related efforts within NASA, 
other federal agencies, industry, and academia. 

This takes the form of collaborative testbeds and 
technology evaluations, as well as dual-use partner- 
ships with industry to infuse new technology into 
the science environment and to encourage its 
subsequent transfer for broadened commercial 
application. 

An excellent example of an effective push and 
pull partnership between NASA science and 
technology programs is the High Performance 
Computing and Communications (HPCC) Program. 
This is a major national initiative, involving ten 
Federal agencies, to extend U.S. leadership in high 
performance computing technologies and accelerate 
the application of these technologies in the Federal 
government and throughout the economy. It is also 
intended to provide critical technology and applica- 
tion components for the new National Information 
Infrastructure. NAS A’ s participation in this pro- 
gram is organized around two principal Grand 
Challenge applications, computational aerosciences 
and Earth and space science. The grand challenge 
in Earth and space science is to demonstrate the 
potential afforded by teraflop system performance 
to further the understanding and ability to predict 
the dynamic interaction of physical, chemical, and 
biological processes affecting the solar-terrestrial 



Chapter I: Overviews 


17 



Figure 2. Illustrations using Office of Space Science-sponsored complementary research and tool developments. Clockwise, from top left: Electric field data from the Plasma 
Wave Spectrometer instrument onboard the Galileo spacecraft during Earth encounter, using the LinkWinds graphical spreadsheet system; Adjustment of temperature in 
an unstable atmospheric column by cumulus convection, as displayed using Gridded Analysis and Display System; three-dimensional rendering of the surface of Venus 
derived from Magellan radar imaging data (DeJong, 1994); and ionospheric-thermospheric system in support of the NASA CRRES mission, using the Space and Atmospheric 
Visualization Science system. 
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8 Grand Challenge 
Principal Investigator 
Teams 

selected through the 
NASA Research Announcement 
(NR A) Process 


Cosmology and 
Accretion 
Astrophysics 
Wojciech Zurek 
Los Alamos 


Large Scale Structure 
and Galaxy Formation 
George Lake 
University of Washington 


Climate Models 
Mac Suarez 
NASA Goddard 


Convective Turbulence 
and Mixing in 
Astrophysics 
Robert Rosner 
University of Chicago 


Atmosphere/Ocean 
Dynamics and 
Tracers Chemistry 
Roberto Mechoso 
UCLA 


Knowledge Discovery in 
Geophysical Databases 
Richard Muntz 
UCLA 


Solar Activity and 
Heliospheric Dynamics 
John Gardner 
NRL 


Four-Dimensional Data Assimilation 
Richard Rood 
NASA Goddard 


Figure 3. Earth and Space Science Grand Challenge Applications selected through the NASA Research Announcement process 
for the NASA High Performance Computing and Communications Program. 


environment and the universe. Twenty-nine 
investigations were selected for participation in the 
program through an open solicitation inviting 
proposals in two categories. Eight Principal Inves- 
tigator teams of space and computer scientists were 
selected to adapt computer intensive models and 
simulations to run on experimental testbeds of 
massively parallel computing systems. These 
investigations are illustrated in Figure 3. There are 
also 2 1 smaller scale guest computational investiga- 


tions focused on specific algorithms and computa- 
tional methods for paral lei computing architectures. 

4. SUMMARY 

Information systems and related technologies 
are essential to the furtherance of our space science 
research objectives. Therefore, it is critical to 
remain abreast of the rapidly advancing technology 
in order to be an early and innovative applier. It is also 
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important to maintain cognizance in the information 
technology arena in order to influence future technol- 
ogy developments as driven by special and / or unique 
needs relevant to space science. 

The key to coupling the demanding space science 
requirements for computing power, data handling 
capability, and scientific visualization with the oppor- 
tunities afforded through information technology is a 
strong strategic partnership involving the space science 
and computer science and technology communities. 
These multidiscipline collaborations build on mutual 
respect for the participating professions and their co- 
dependence for success. 

Information technology is truly an explosive 
field. It is advancing at such a rate that it acceler- 
ates the actual realization of our potential future far 
more quickly than the time horizon used for the 
projection. And, by tradition, the science research 
community has pushed the envelope in exploiting 
advances. The combined effect offers an extremely 
exciting realm of possibilities. 

The remaining chapters in this book provide an 
initial set of successful collaborations to enhance 
the scientific research environment, and a glimpse 
into the prospects for more to come. 
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T.A. POTEMRA, B.J. ANDERSON 

The Johns Hopkins University Applied Physics 

Laboratory , Laurel, MD 

The Johns Hopkins University Applied Physics Labo- 
ratory (JHU/APL) Magnetic Field Experiment Data 
Analysis System (MFEDAS) has been developed to 
process and analyze satellite magnetic field experi- 
ment data from the TRIAD, MAGSAT, AMPTE/CCE, 
Viking, Polar BEAR, DMSP, HILAT, UARSandFreja 
satellites. The MFEDAS provides extensive data man- 
agement andanalysis capabilities. The system is based 
on standard data structures and a standard user inter- 
face. The MFEDAS has twomajor elements: Ijasetof 
satellite unique telemetry processing programs for 
uniform and rapid conversion of the raw data to a 
standard format and 2) the program Magplot which 
has file handling, data analysis and data display sec- 
tions. This system is an example of software reuse, 
allowing new data sets and software extensions to be 
added in a cost effective and timely manner. Future 
additions to the system will include the addition of 
standard format file import routines, modification of 
the display routines to use a commercial graphics 
package based on X- Window protocols and a generic 
utility for telemetry data access and conversion. 


1. INTRODUCTION 

The Johns Hopkins University Applied Physics 
Laboratory (JHU/APL) Magnetic Field Experiment 
Data Analysis System (MFEDAS) was created to 
provide magnetic field data access and analysis 
capabilities for satellite magnetic field experiment 
data from TRIAD, MAGSAT, AMPTE/CCE, 
Viking, Polar BEAR, DMSP, HILAT, UARS and 
Freja. The MFEDAS provides visual access to these 
data which total more than 50 gigabytes in size. 
Figure 1 is an artist’s conception of the field 
aligned or Birkeland current system which connects 
the Earth’s ionosphere in the northern and southern 
auroral zones to the magnetosphere and solar wind. 
The figure shows the cone-shaped model of 
Birkeland currents with the accompanying 
horizontal electrojet current system. These current 
systems were discovered by APL spacecraft in 1 963 
and subsequently studied by using magnetic field 
disturbances measured by spacecraft-borne 
magnetometers. The MFEDAS has been developed 
to efficiently extract information on these current 
systems from the instrument telemetry stream, 
perform the transformations needed for analysis , 
remove any background signals or noise that mask 
the features of interest, and display the analyzed 
data. The MFEDAS system combines and 
automates these tasks for efficient, individual 
event analysis and for rapid, automatic large 
scale processing. 

The evolution of MFEDAS has been driven by 
two factors. The first is the increased rate of sam- 
pling frequency and overall data volume (TRIAD 
was 3 axis, 13 bits per sample and 1/2 Hz sampling, 
Freja is 7 channels, 16 bits per sample, 128 or 256 
Hz sampling). The second factor is the need to 



24 


Visualization Techniques in Space and Atmospheric Sciences 



DMSP A HILAT 

8 FEB 86, 2 JAN 87 4 13 MAR 89 


ELD ALIGNED CURRENT MAGNETIC 
DISTURBANCES 


STORMS: 

. ACTIVITY: 


MARCH 1989 
FEBRUARY 1986 
JANUARY 1987 


Figure l.An artist ‘s conception of the ionospheric current systems. The graph shows an example of spacecraft data measurements 
produced by these systems. 


efficiently analyze data from a large number of 
satellites. During the design of MFEDAS, specific 
data set and application software commonalities 
were determined. These common elements have 
been incorporated into standard data structures, 
allowing the construction of modular, expandable 
software tools. Because of these standards, existing 
software becomes the basis for extensions of the 
processing and analysis software set and for the 
addition of the software required for new data sets. 
This software reuse allows these extensions to be 
rapidly completed with a minimum of redundant 
effort (e.g., FREJA required 3 staff months to 
incorporate into MFEDAS). 

The first section below describes the logical 
steps in processing and analyzing these data sets. 
The next section describes the software required. 
The final section gives some examples of applica- 
tions of the MFEDAS. 


2. MAGNETIC FIELD DATA PROCESSING 

AND ANALYSIS 

The MFEDAS analysis of magnetic field data is 
a process of several logical steps. The raw instru- 
ment data (in units of instrument counts) is con- 
verted to real physical units (typically nanoTeslas) 
by applying calibration and offset corrections. A 
time is associated with each magnetic field mea- 
surement. Erroneous data points are detected and 
data error handling is performed. Early quality 
evaluation and data screening are performed to 
reduce the amount of redundant error checking that 
would otherwise be needed in later routines. The 
resolution of the data may be reduced as part of the 
initial access to reduce the volume of the data to be 
analyzed. The measured data is rotated to an 
orthogonal XYZ instrument coordinate system and 
converted from instrument to science units. The 
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values for the conversion factors are estimated by 
measurements on the ground and refined by using 
measurements during the mission. Once the data are 
in science units, they are rotated into the standard 
spacecraft referenced coordinate system. The final 
data transformation is made from the spacecraft 
reference system into an Earth or star-fixed refer- 
ence system by applying transformations that 
incorporate spacecraft attitude and location data. 

Analysis often includes the tasks of separating 
the ionospheric induced fields from the measured 
field which is a composite of the main geomagnetic 
field, the ionospheric induced fields, and other 
noise sources within the instrument and spacecraft. 
The first step in separating the field disturbances 
from the measured values is the subtraction of a 
model of the geomagnetic field. The spacecraft 
position and attitude must both be well known to do 
this, because the disturbances are only a small part 
of the measured field on the order of a few percent 
of the Earth’ s field. The data are transformed into 
common coordinates and common displays to allow 
the data to be compared with other data sets and to 
best display specific features. 


3. DATA PROCESSING 

The MFEDAS provides the software tools needed 
for magnetic field experiment data set proces sing and 
analysis tasks described in the preceding section . 
Figure 2 shows the two major processing steps of the 
system and data flows through the system. The first 
step contains satellite unique telemetry processing 
software designed to allow users to read and summa- 
rize the resulting data and to obtain uniform and rapid 
data conversion to a standard format. The second 
major processing step, performed by the program 
Magplot, includes the set of general routines to 
perform the tasks required for data analysis and display 
for all data sets. Magplot includes common file access 
and file manipulation routines, analysis software 
including filtering, averages, fits, noise reduction and 
FFT’ s and displays. The display options include 
multiple data item plots, polar plots and spectrograms. 
Magplot has internal standards for magnetic field data 
structures, ephemeris data access, and for calling 
parameters. These standards are the basis for reuse of 
the existing software for new data sets and for 
extensions to the software set. 



Figure 2. The figure shows input data sets, processing modules and products of the MFEDAS. On the left are the satellite unique processing 
software modules. The central part of the MFEDAS is the generic software in the program Magplot, which can be reused for new: data sets. 
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Each new data set requires a unique software 
data extraction module to be created to deal with 
the variations in the input data sets. Satellite data 
sets are made available on a range of physical 
media and in unique satellite telemetry stream data 
formats. The extraction software contains the 
locations and data types of the required fields in the 
input data stream and the operations required to 
convert these fields to a usable value. These opera- 
tions include bit, byte and word access and conver- 
sion to local types, real data representation conver- 
sions, and other data type conversions. The large 
input data stream size requires that these operations 
be done in an efficient manner. Developing this 
software can be the most time consuming part of 
providing access to a new data set. 

A planned extension to the MFEDAS is a 
generalized utility that uses a graphic interface to 
interactively create the mapping between the data 
stream and standard format output files. This utility 
would make available an extensive set of tools for 
these operations. The utility would remove the 
inefficiencies of individual software creation by 
providing a rapid and self documenting method for 
low level data access. 

Figure 3 diagrams the layers of the logical shell 
structure of the general analysis and display pro- 
gram Magplot. Magplot consists of a core data 
structure and operators on this structure. The three 
segments of operations share the core data struc- 
ture. The operations are accessed through an 
interactive user interface or the command script 
interface. The labels in Figure 3 are described in 
detail in the paragraphs below. 

3. 1 Standard Data Structure and Data Structure 

Handling 

All routines use a common data structure. This 
structure consists of a descriptive header and a 
multiple record data area. The header consists of 24 
integer descriptive values that encode the type of 
data, satellite that creates the data, coordinate 
system, times and further descriptive information. 
The data area consists of a column of time (seconds 
of the day) and up to nine columns of data. A set of 
routines has been developed to display and modify 
the header and to display, scale, and reorder the 
data structure. 



Figure 3. Shell structure diagram showing the logical layers in 
the program Magplot. 


3.2 Data File Handling andEphemeris Data Access 

Routines are provided for input and output of 
standard format data files. Ephemeris data is 
available from within Magplot by standardized calls 
that input the data from disk files. A data base of 
the ephemeris data available is used to provide 
rapid, automatic file selection. 

3.3 Data Analysis and Filtering 

Extensive analysis software is available includ- 
ing filtering, averaging, fits, noise reduction, FFT’ s, 
plots and spectrograms multiple filters, polynomial 
fits, and averages. 

3.4 Graphic Creation and Display Handling 

Displays include multipanel time series plots, 
polar dial plots, and spectrograms. Output display 
device drivers are available for X-Windows, 
TEK4010, QMS Laser printer, and PostScript. 

3.5 User Interactive Interface 

The user interface is flexible, menu driven, and 
rapidly modifiable. The tree structure for command 
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selection allows maximum user flexibility. Figure 4 
shows the first two levels of menus. 

3.6 Command Script Interface 

Prototype command scripts have been set up to 
allow rapid creation of scripts for specific tasks. 
These are used to create specific command scripts 
to be used for production processing. 

The MFEDAS includes a set of modules to 
perform coordinate system transformations for 
standard coordinate systems: geographic and 
geomagnetic; earth centered and local; and Earth, 
solar and stellar fixed. 

4. EXAMPLES OF THE ANALYSIS OF 
MFEDAS RESULTS 

Figure 5 is a composite plot of six of the JHU/ 
APL spacecraft data sets produced by the 
MFEDAS. The plots are polar plots in magnetic 
latitude versus magnetic local time of disturbance 
vectors transverse to the main field with bases 
along the orbit trajectory. The coordinate systems 
shown have one component parallel to the magnetic 
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field and the other two orthogonal components are 
in the North and East or Sun and Dawn directions. 

The UARS magnetic field data of the upper left 
panel shows transverse disturbance vectors indicat- 
ing strong Region 1 (middle cone of Figure 1 ) and 
Region 2 (equatorward cone of Figure 1) Birkeland 
current systems associated with an expanded 
auroral oval (-60° MLAT circle). These large scale 
dynamics are common for the present solar cycle 
maximum. The third line trace of the UARS panel 
is a plot of magnetic disturbances parallel to the 
main field indicating the ionospheric electrojet 
currents drawn schematically as extended horizon- 
tal arrows. The UARS panel is a composite of 
northern and southern hemisphere passes. 

The FREJA panel shows a sample of recent 
data showing the dayside Region 1 and Region 2 
Birkeland current disturbances during an active 
magnetic storm development on December 28, 

1992. Polar cap disturbances have been artificially 
nulled. 

The MAGSAT panel is a composite of three 
polar passes during northward IMF with the trans- 
verse vectors indicating the polar cap NBZ 
Birkeland current system (inner cone of Figure 1 ), 


MAGFIL 

File access routines 

MAGREAD: Input standard format 
magnetometer data 
MAGWRIT: Write a file of the 
current data 

READDAT: Free format file input 
BLST: List header variables 
CMNT: Change header variables 
SAMP: Create sample data set 
SWAP: Move main data arrays to/ 
from secondary arrays 
OAOPS: Call att. data access routines 
OAT AG: Add att. data to data 
OALIST: List all att. data file 
OAINDEX: Access att. data index 
GETOA: List att. data 


MAGPLOT 

MAGPRO 

Data processing routines 

GEOGM: Rotate from satellite to 

geographic coordinates 

FIX: Remove bad data points with 

automatic selection 

SLAV: Sliding average 

AVG: Average 

TAPR: Taper data 

DTRN: Detrend data 

POLFIT: Detrend data using a 

polynomial fit 

FILT: Bandpass or bandstop filter 
using Fourier transforms 
FRTR: Fourier transform 
IFTR: Inverse Fourier transform 
TRNC: Limit time range of data 
POWR: Spectral power density 


MAGPLQTS 
Plotting routines 

XYPLOT: Time vs. B(l-4) or 
Frequency vs. Amplitude 
POLARPLOT:Polar plot & three 

panel T vs. B 

MLATB: Magnetic latitude vs. B 
POL6: Polar vector plots 


Figure 4. Command menus and command names available in Magplot. The First level shows the three program divisions. The 
second level shows command options. 
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which has been proposed to be driven from neutral 
wind inertia and thermosphere-ionosphere momen- 
tum, coupling back to the ionosphere during quiet 
times following magnetic storm activity. The 
magnetic field disturbances parallel to the main 
field indicate the antisunward ionospheric Hall 
current associated with these events and are indi- 
cated schematically. 

The Viking and HILAT panels are correlative 
data from the two spacecraft at the same time. 
Transverse disturbance vectors are displayed which 
infer the large scale Birkeland current system 
during this transition from northward IMF into the 
storm time expansion phase. 

The AMPTE panel shows data mapped along 
magnetic field lines from the CCE equatorial orbit 
on the dayside indicating the ionospheric location 
of magnetosheath/cusp field disturbance signatures. 


Figure 6 shows polar plots of six hours of 
magnetic field data created with a MFEDAS 
extension that uses the information in the AC 
channel parameter file to automatically produce 
plots during periods of high activity. The NASA 
Upper Atmospheric Research Satellite (UARS) was 
launched September 1 2, 1 99 1 , with an APL mag- 
netic field experiment. The unique stability of this 
platform and the full orbit data collection have 
provided geomagnetic observations from UARS 
that approach MAGS AT quality . The standard 
production of the UARS magnetometer summary 
data includes the processing of the 5 - 50 Hz peak 
detector channel. This circuit responds to wave 
phenomena in that frequency range and because of 
the wave turbulence attendant to the Birkeland 
currents and the high frequency power in the 
Birkeland current signature discontinuities. This 


UARS Magnetometer Data April 5, 1993 
Northern Hemisphere 



Figure 6. Polar plots of six hours of magnetic field data from the UARS spacecraft showing magnetic field activity during 
successive orbits on April 5,1993. The time periods for these plots were selected automatically from the activity index values and 
the command script interface to Magplot was used to control file access and plotting. 
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AC channel serves as an indicator of spacecraft 
passage into the auroral zone, reflecting both the 
intensity of the currents and their duration. During 
automatic production processing, a full day plot is 
produced and a condensed on-line ASCII file of AC 
channel parameters is created, which gives start and 
stop times and locations of auroral zone crossings 
above a significant threshold level. The FREJA 
magnetic field experiment has a similar “event 
detector” whose data can be received real-time or 
transferred via network from Sweden. 

5. SUMMARY 

The MFEDAS is a comprehensive system for 
efficiently accessing and analyzing magnetic field 
data. The system is an example of software reuse 
where the sophistication and breadth of the tasks 
that the software performs have grown with the 
number of data sets available to the system. This 
system growth is achieved by maintaining standard 
data structures and interfaces. This has allowed the 
accumulation of knowledge about the requirements 
of magnetic field data analysis to be incorporated 
into the MFEDAS software. 

The MFEDAS has successfully integrated the 
nine scientific magnetic field data sets into a 
common analysis system. Although the initial 
development of this system took a typical amount 
of resources and level of effort, subsequent data 
sets, such as UARS and Freja, have been integrated 
for a small fraction of a staff-year and using mini- 
mal resources. The built-in versatility has proved 
invaluable to troubleshooting data, spacecraft 
problems, and variations inevitable with each new 
data set. The MFEDAS is a permanent resource, 
providing new projects with a quick jump start to 
data analysis and science investigations by decreas- 
ing software development time to a minimum and 
reducing costs to sponsors. 
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Visualization tools are being developed to meet the 
challenges of mission planning and data analysis 
presented by the International Solar-Terrestrial Physics 
(ISTP) program. ISTP encompasses a large number of 
spacecraft, multiple ground-based observatories and 
several theoretical investigations, with the goal of 
understanding the global behavior of the solar wind- 
magnetosphere-ionosphere system. The tools include 
three-dimensional displays of key boundaries in geospace 
along with spacecraft trajectories, which can be animated 
and synchronized to universal time. Magnetic field 
models and MHD simulation results can be invoked to 
reveal the magnetic topology or to identify magnetic 
conjunctions between spacecraft and/or ground-based 
facilities. Simultaneous displays of satellite trajectories, 
spacecraft-borne observations, and model predictions 
are available to facilitate data processing and 
interpretation efforts. The current status of these tools is 
described, and their implementation at the ISTP Science 
Planning and Operations Facility and distribution to the 
entire ISTP community are discussed. 


1. INTRODUCTION 

The International Solar-Terrestrial Physics 
(ISTP) program seeks to understand the physical 
processes that transport mass, momentum and 
energy from the Sun, through the interplanetary 
medium, into and through the Earth ’ s magneto- 
sphere, and finally into the ionosphere. Specifi- 
cally, the program focuses on the study of plasmas 
and magnetic fields of the Sun, the Earth, and the 
space in between. The program involves simulta- 
neous, coordinated measurements from key areas of 
geospace. The scope and magnitude of the ISTP 
mission demand a sophisticated set of visualization 
tools to perform operational planning and coll abo- 
rative analysis of data from space-borne instru- 
ments, ground-based observations, and empirical or 
theoretical models. 

Visualization of ISTP data is particularly 
complicated in two respects. First and foremost is 
the inherently global nature of the project, which 
demands integrated views of the data. Scientists are 
faced with the challenge to visualize and analyze 
data collected by different instruments onboard 
multiple spacecraft situated in vastly different 
regions of geospace, and to integrate these data with 
those obtained from ground-based or ionospheric 
measurements. While theory and modeling efforts 
provide the “maps” to facilitate comparisons and 
calibrations of observations from disparate sources, 
interactive visualization tools are essential to 
understanding the spatial and temporal relation 
between the observations in the context of the entire 
solar wind-magnetosphere-ionosphere system. 
Access to effective scientific visualization tools is 
now possible due to the development of powerful 
workstations. The second factor complicating the 
visualization of ISTP data is the unprecedented 
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volume and diversity of data expected from ISTP 
investigations. The ISTP datapool includes observa- 
tions from ISTP satellites, as well as data from other 
spacecraft missions which have established research 
collaborations with the ISTP program. In addition, 
there are ground-based and theory investigations. 
Complementary data will be integrated through 
coordinated science campaigns with other maj or 
international science initiatives. All together, some 
twenty spacecraft, tens of ground-based observatories, 
and several theory investigation groups will provide an 
estimated 2.2 gigabytes of data per day [seeMish et 
al., 1 994 for a detailed description of the ISTP data 
systems and products] . In this communication, we 
describe the current capabilities of the Visualization 
Tools developed for the ISTP program, their imple- 
mentation at the ISTP Science Planning and Opera- 
tions Facility (SPOF) [Peredo, 1993], and enhance- 
ments currently underdevelopment. 

2. THE ISTP PROGRAM 

As indicated above, the ISTP program is an 
international, multi- spacecraft project aimed at 
developing quantitative and predictive models of 
the flow and transfer of particles, momentum, and 
energy from the Sun, through interplanetary space 
into the magnetosphere and eventually into the 
Earth’ s ionosphere. To accomplish these goals, the 
ISTP program involves simultaneous coordinated 
observations throughout key regions of geospace 
including the polar auroral regions, the equatorial 
magnetosphere at geosynchronous distance, the 
geomagnetic tail, the magnetosheath and the solar 
wind. Complementing the in-situ spacecraft 
measurements are remote sensing data from 
ground-based observatories, and model and simula- 
tion results from theoretical investigations. 

Spacecraft observations of the Sun will be 
provided by SOHO; the solar wind state will be 
primarily monitored by the WIND spacecraft; and, 
IMP-8 will provide additional solar wind data 
during a portion of its orbit, and magnetosheath, 
and tail observations in the remainder of each orbit. 
The POLAR and INTERBALL- AURORA space- 
craft will concentrate on the Earth’ s polar auroral 
regions, while the GEOTAIL and INTERBALL- 
TAIL probes will sample the geomagnetic tail. The 
equatorial region is being covered by existing 
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missions from the DOE Los Alamos N ational Labora- 
tory spacecraft and the NOAA Geostationary Opera- 
tional Environment Satellite program. The four- 
satellite CLUSTER mission will provide detailed 
observations of small-scale, fast phenomena in the 
magnetosphere, magnetosheath, and solar wind. 

Ground-based observations will be contributed 
from the Canadian Auroral Network for the Origin 
of Plasmas in Earth’ s Neighborhood Program 
Unified Study (CANOPUS), by the Satellite 
Experiments Simultaneous with Antarctic Measure- 
ments (SESAME), the Sondrestrom Radar, and the 
Dual Auroral Radar Network (DARN). A wide 
variety of ground-based instruments are used to 
measure, among other things, particle precipitation 
and flux, magnetic-wave intensity and polarization, 
plasma velocity and convection, and electric fields. 
The ground-based observations serve a dual pur- 
pose. They provide remote sensing of the magneto- 
spheric space separating the spacecraft, while also 
providing high-latitude ionospheric measurements 
that serve to define key boundary conditions 
required to couple the ionospheric and magneto- 
spheric components of theoretical models. 

An innovative aspect of the ISTP program is 
the role played by theory and modeling investiga- 
tions. The theoretical models provide a framework 
for understanding the spatially scattered and diverse 
spacecraft and ground-based observations. The 
observations in turn provide a standard to evaluate 
and improve the models. Theoretical investigations 
are being pursued on three levels. The global 
structure of the solar wind-magnetosphere system is 
being modeled by large scale magnetohydrody- 
namic (MHD) computer simulations and by empiri- 
cal magnetic field models. The structures of 
important boundary layers, such as the magneto- 
pause, are being studied using hybrid simulation 
codes and analytic models. The microscale plasma 
processes that may determine mass, momentum, 
and energy exchange across these structures are 
being investigated with kinetic and fluid plasma 
theory and computer simulation. 

3. THE ISTP/SPOF 

The ISTP Science Planning and Operations 
Facility (SPOF) [Peredo, 1993] is the component of 
the Global Geospace Science program responsible 
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for the development and coordination of ISTP 
science planning operations. The SPOF operates 
under the direction of the ISTP Project Scientist and 
is responsible for the development and coordination 
of the science plan for ISTP spacecraft. SPOF 
executes ephemeris and model software to generate 
science planning and event times, and provides the 
link for WIND and POLAR scientists to submit 
experiment operating instructions. The SPOF 
reviews Key Parameter data (low resolution, ~ 1 
min., survey data), and aids investigators in the 
identification of intervals of scientific interest for 
detailed study. SPOF scientists also collaborate 
with ISTP Theory Investigators on the develop- 
ment, evaluation, assessment, and comparison of 
magnetospheric models. While the SPOF worksta- 
tions are restricted to SPOF personnel, tools devel- 
oped for mission planning and data analysis, 
including those described in this communication, 
will be made available — without any support 
commitment — to the entire ISTP community via the 
ISTP Central Data Handling Facility. The tools 
described in this communication will also be 
available via anonymous ftp from avl.umd.edu. 

Note that interested users must have the appropriate 
commercial licenses for AV S , PV- Wave and IDL in 
order to run the tools. 

4. THE ISTP VISUALIZATION TOOLS 

Understanding the critical role of visualization 
for ISTP, the SPOF, and the University of Mary- 
land Advanced Visualization Laboratory [Goodrich 
andMcNabb, 1992] are jointly working to develop 
interactive tools to meet the science planning and 
data analysis needs of ISTP [Goodrich et al., 1993, 
McNabb et al., 1993], These tools are based on 
widely used, modular and extensible applications 
including the Application Visualization System 
(AVS) [Upson et al., 1989], the Interactive Data 
Language (IDL) [IDL, 1993], and PV - W ave [PV - 
Wave, 1993]. The modular nature of these pack- 
ages means that ISTP investigators can combine 
efforts to build better tools. The SPOF and the 
AVL act as the central point for integration and 
distribution of the tools, but modules developed by 
other scientists can easily be incorporated into the 
tools. While the tools have been developed and 
tested on SUN and DEC unix workstations, the 
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highly portable nature of AVS, IDL, and PV-Wave 
should make it easy to install and ran on any 
platform for which these products are available . 

Current capabilities of the ISTP visualization 
tools include generation of integrated three-dimen- 
sional displays of satellite trajectories, empirical 
models of key magnetospheric boundaries, and 
magnetic field lines from empirical models or 
magnetohydrodynamic simulations. The spacecraft, 
boundaries and magnetic flux tubes may be ani- 
mated, synchronized to universal time, and the 
ionospheric foot points may be displayed interac- 
tively. Simultaneous display of in situ, ground- 
based and simulation data are readily incorporated. 
In the following sections, we describe three AVS- 
based prototype applications that illustrate the 
current state of the ISTP visualization tools. In 
addition, we discuss their role for mission planning 
and data analysis functions at the ISTP/SPOF. 

4. 1 Spacecraft and Magnetospheric Regions 

Display 

In order to meet the global needs of the ISTP 
program, our visualization tools allow the user to 
generate three-dimensional displays of spacecraft 
trajectories within the context of the entire 
geospace system. We have constructed three- 
dimensional surfaces representing boundaries 
between key magnetospheric regions . These 
boundary representations are largely based on 
empirical, data-based models and are compatible 
with those used by the NASA Satellite Situation 
Center (SSC) [SSC, 1992;Peredoetal., 1992a.; 
Parthasarathy etal., 1994; Aist-Sagaraetal., 1993], 
The SSC tools are used extensively at the SPOF and 
are complementary to the visualization and data 
analysis tools described in this communication; as 
such, use of compatible definitions for the key 
regions of geospace is of utmost importance. The 
boundary representations have been revised and 
updated extensively over the last two years. Our 
visualization tools employ an important subset of 
the regions defined for SSC applications; the three 
key boundaries included in our three-dimensional 
magnetospheric displays are: (1) the magnetopause 
model of Sibeck et al. (1991), (2) a modified 
version of the Fairfield (1971) bow shock model ; 
this modified version was introduced as part of the 
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update of the SSC applications; the original bow 
shock surface derived by Fairfield is modified by 
allowing the surface to move along the Sun-Earth 
direction in response to variations in the solar wind 
dynamic pressure subject to the constraint that the 
bow shock stand-off distance is 1 .3 times the 
magnetopause stand-off distance (see SSC, 1 993 for 
details), and (3) the neutral sheet model of Fairfield 
(1980). Close interaction exists between the SPOF 
and SSC groups, and any future updates or revi- 
sions to the magnetospheric region definitions in 
the SSC and visualization tools will be coordinated. 
Users may replace these boundary representations 
with those of their choice by creating AVS geom- 
etries corresponding to the model(s) of interest and 
using our existing modules as example code. 

Our first example AVS application produces 
the three-dimensional spacecraft and geospace 
displays discussed above. In addition to spacecraft 
trajectories and the magnetospheric boundaries, our 
displays may include orienting features, such as 
continental outlines on the Earth and either Geocen- 
tric Solar Magnetospheric (GSM) or Geocentric 
Solar Ecliptic (GSE) coordinate axes. Satellite 
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trajectories are readily generated from predictive or 
definitive ephemeris files in the Common Data 
Format (CDF) adopted by ISTP for many of its data 
products. Displays can be easily animated to 
illustrate the motion of the satellites and the dis- 
placement of the boundaries in response to varia- 
tions in the model parameters, including solar wind 
dynamic pressure or in the orientation of the Earth’s 
magnetic dipole. For these animations, all objects in 
the display are synchronized with respect to univer- 
sal time. 

Using this AVS application, we can interac- 
tively visualize and identify advantageous space- 
craft configurations. This application will be used 
during mission planning activities to identify 
potential windows of opportunity to address spe- 
cific science questions, and the experimenters will 
be alerted to configure their instruments accord- 
ingly. During data analysis efforts, this AVS 
network will help in the interpretation of simulta- 
neous observations from multiple spacecraft-borne 
and ground-based instruments. Figure 1 shows a 
sample predicted configuration including the 
GEOTAIL, INTERBALL-TAIL, INTERB ALL- 



Figure 1 . Sample spacecraft configuration for global studies of magnetotail dynamics. IMPS ( green ) acts as the solar wind monitor, 
INTERBALL-A URORA ( yellow ) monitors the polar auroral region, GEOTAIL (pink) samples the deep tail while INTERB ALL-TAIL 
(dark red) explores the high-latitude magnetotail; the moon is shown (larger red sphere) for reference. 
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AURORA, and IMP-8 spacecraft. The image was 
generated during a planning session for the First 
Science Campaign of the Inter-Agency Consultative 
Group [Whipple and Lancaster, 1 992], and illus- 
trates a desirable configuration for magnetotail 
dynamics investigations. IMP-8 provides solar wind 
information, AURORA samples the polar cap 
regions while TAIL and GEOTAIL cover, respec- 
tively, the high-latitude and deep geotail regions. 

The AVS network used to create Figure 1 
contains the module we have developed to generate 
the surfaces corresponding to the key geospace 
boundaries detailed earlier; entire surfaces may be 
shown, or half-surfaces can be displayed where the 
slice is in the GSM x-z plane. Each surface is a 
separate geometric object, and the AVS built-in 
functionality to modify object properties can be 
used to individually assign color and transparency 
properties to yield the preferred view. This surface 
module has been enhanced to conceptually display 
th e windsock effect [Hones etal., 1986] illustrated 
in Figure 2, and thought to occur as a change in 
solar wind direction upstream of the magnetosphere 


reaches the magnetospheric boundary layers and 
propagates downtail along the magnetosphere. The 
location, angle, and extent of the solar wind distur- 
bance can be controlled interactively, and it is easy 
to produce an animated three-dimensional display 
of the propagation effect. The qualitative represen- 
tation in Figure 2 actually agrees well with the 
quantitative simulation study of the windsock effect 
by Goodrich and Ly on [ 1 992] . 

Each of the satellites shown in Figure 1 (and 
their trajectories) are generated by a copy of the 
satellite macro module that reads the ephemeris 
data, automatically performs the necessary time and 
coordinate transformations, and produces two 
output fields : the spacecraft position at the selected 
universal time, and the list of points corresponding 
to a trajectory segment “about” the current position. 
The user can interactively control the duration of 
the trajectory segment. A placement slider param- 
eter selects the portion of the trajectory segment 
appearing ahead of and behind the spacecraft. 
Another custom module allows selection of the 
current universal time, and permits proper synchro- 



Figure 2. Simulation of the Windsock Effect. A change in direction of solar wind velocity produces a kink that propagates along 
the bow shock, magnetopause and neutral sheet surfaces. Our A VS application allows interactive control of the extent and 
sharpness of the kink to simulate specific variations in the upstream conditions. 
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nization of all objects in the display. A library of 
conversion routines has been created to facilitate 
conversions between different formats commonly 
used by the NASA community to represent time. 
The library also includes routines for time-arith- 
metic necessary for efficient data searches and 
selections. A macro module composed of public 
domain modules is used to display the Earth as a 
sphere with overlaid continental outlines and 
latitude and longitude contours. The interactive 
three-dimensional controls of AVS are especially 
helpful in this network when examining in detail 
such events as the crossing of one of the surfaces by 
a satellite. The user can easily rotate, translate, or 
zoom the display to view the event from multiple 
angles and assess its importance. 

4.2 Interactive Display/Identification of Magnetic 

Conjunctions 

The second AVS application we discuss is 
based around our magnetic field line tracing mod- 
ule. The underlying code includes routines for the 


widely usedTsyganenko models [Tsyganenko 
1987, 1989, 1990], as well as improved variants of 
these models recently derived [Peredo et al . , 1 992b ; 
Peredoetal., 1993; Stem and Tsyganenko, 1992]. 
The module permits generation of three-dimen- 
sional displays of magnetic field lines traced from 
one or more seed points. Synchronization to univer- 
sal time is inherently built into the network. One 
use of the module involves tracing lines from a 
large number of seed points on the surface of the 
Earth out into the magnetosphere; the resulting 
display shows the global structure of the magnetic 
field according to the selected model, and may be 
used to identify graphically the boundary between 
closed (both ends reach the Earth) and open (only 
one end reaches the Earth) field lines. An alterna- 
tive use is to select a spacecraft position as the seed 
point, and to trace the magnetic flux tube passing 
through the spacecraft down to the Earth’s iono- 
sphere, as illustrated in Figure 3 . Such a display, 
combined with an overlay of ground-based observa- 
tory locations and continental outlines on the Earth, 
readily provides an interactive tool for visual 



Figure 3. Magnetic conjunction between the POLAR ( red) spacecraft and a ground-station of the CANOPUS array. ( red dots on the 
surface of the Earth); the SAMPEX probe (green) maps to higher latitudes. Field line traces are according to the Tsyganenko 89 
magnetic field model corresponding to highest magnetic activity level (Kp>5). 
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identification of magnetic conjunctions. These 
conjunctions, either between a spacecraft and a 
ground station, or between multiple spacecraft hold 
special scientific significance since charged par- 
ticles travel preferentially along magnetic field 
lines, and thus magnetic conjunction times are 
prime candidates for effective correlative studies 
between ground-based and spaceborne experiments. 

4.3 In Situ Data Display/Comparison 

To provide researchers with powerful integrated 
visualization and analysis tools, we complement the 
three-dimensional spacecraft and boundaries 
displays with simultaneous time-series plots com- 
paring predicted theory or model “data” to the 
actual measurements obtained by the spacecraft. 

As with our previously described applications, the 
spacecraft and boundaries motion, and the model 
and observed data on the time-series displays, are 
all synchronized to universal time. 

To display the time-series data, we use AVS 
modules developed with the SAVS program 
[Szuszczewicz, et al., 1994] that call PV-Wave and 
use its functionality, in this case to produce stacked 
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line plots. The modules are linked to PV-Wave at 
compile time, exchange data and communicate with 
PV-Wave through shared variables, and directly 
call PV-Wave routines to execute arbitrary com- 
mand language statements. Because the ISTP 
community is divided among their access to PV- 
Wave or IDL, we are working with Research 
Systems Inc. to develop a similar module that 
communicates with IDL. An illustration of our 
joint displays appears in Figure 4, where magnetic 
field components observed by IMP-8 are compared 
with the results of an MHD simulation. For sim- 
plicity in this example, we have interpolated the 
simulation results from a single time step along the 
IMP-8 orbit. The field components show very good 
general agreement, despite the limitation of actually 
comparing spatial variation in the simulation results 
with the time series of IMP- 8 data. Though more 
arduous, we can and will be including the time 
dependence of the MHD results in these compari- 
son. This application allows superposition of data 
and results from several sources; such data may 
come from empirical models such as the 
Tsyganenko magnetic field models [Tsyganenko, 
1987, 1989, 1990], from numerical simulations, 



Figure 4. Simultaneous Display of Model/Spacecraft Data and three-dimensional Magnetospheric Configuration. The three- 
dimensional display shows IMPS exiting the magnetosphere on the dawn side, while the stacked data plots show the magnetic field 
components observed by the spacecraft ( white) and predicted by an MHD simulation ( blue). These data plots were generated with 
PV- Wave using data imported from A VS via the PV- Wave/ A VS module developed with the SA VS program [ Szuszczewicz, et al. , 1 994 ]. 
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such as the MHD simulations developed by the 
ISTP Theory Group [Raeder, 1992; Convery et al., 
1992; Goodrich and Lyon, 1992; Lyon and Fedder, 
1992] or from actual spacecraft observations. 

5. SUMMARY AND FUTURE PLANS 

A suite of visualization and data analysis tools is 
under development for the ISTP project. Several 
prototype applications have been created and are in 
active use by the ISTP/SPOF. Distribution of these 
tools to selected ISTP investigators with AVS 
experience is currently underway. Their experience 
with the tools and feedback will be used to deter- 
mine additional features desirable to tailor the 
applications to the true needs of space scientists. In 
addition, we plan to incorporate enhancements 
suggested by our own use of the tools within the 
SPOF and ISTP Theory Groups. For example, we 
are developing more sophisticated CDF-readers to 
handle multiple ephemeris data sources and diverse 
key parameter data from all ISTP investigations. 
Future plans also include seamless integration 
between AVS, IDL, and PV-Wave to allow data 
exchange between any of these products already 
widely used by the space physics community. 
Finally, we plan to expand our currently limited 
distribution of the tools to include all members of 
the ISTP community who request these tools. 
Distribution of the tools is currently on an “as-is” 
basis, with minimal documentation and no support 
commitment. 
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During the time from December 1991 through March 
1 992, there were four operational DMSP satellites in 
polar orbit. All four satellites carried the SSIES 
plasma package which included an ion drift meter. 
Data from the drift meter, combined with the magnetic 
field data, allowed the calculation of the electrostatic 
potential in the ionosphere along the satellite ’s path. 
Simultaneous polar coverage by four satellites was 
unprecedented, providing researchers with almost con- 
tinuous monitoring of the potential distribution in both 
hemispheres for the four month period. Combining the 
magnitude and location of the potential data from each 
of the four satellites in order to examine the varying 
potential distribution pattern in both hemispheres pre- 
sented a major challenge in data visualization. The 
problem was solved by developing a three-dimen- 
sional presentation of the data where the potentials are 
color coded and represented by the vertical dimension. 

This paper presents examples from a computer anima- 
tion of several days of data demonstrating evolution of 
the size and shape of the potential distribution, along 
with how these changes correspond to variations in 
other geophysical parameters, such as the IMF orien- 
tation and the K p index. 


1. INTRODUCTION 

The National Center for Supercomputing 
Applications (NCSA) produced video “Numerical 
Modeling of a Severe Thunderstorm” [Wilheimson 
et al., 1989] is perhaps the best known example of 
the application of computer animation to present a 
large and complex scientific data set in a clear 
manner. As such, it sets a standard that many 
scientists strive to equal in the animated visualiza- 
tions of their own results. However, most scientists 
and researchers do not have access to the high-end 
computers and video resources necessary to pro- 
duce such impressive animations. This paper will 
describe a system developed at the Center for Space 
Sciences at the University of Texas at Dallas for 
producing high quality stills and animations using 
relatively inexpensive equipment. More impor- 
tantly, this paper will also demonstrate how these 
animations and visualizations are applied to the 
large data sets used in the analysis of the shape and 
evolution of the electrostatic potential distribution 
in the polar regions of the Earth’s ionosphere. 

2. BACKGROUND 

The Center for Space Sciences builds an ion 
drift meter instrument as part of the Special Sensor- 
Ions, Electrons, Scintillation (SSIES) package, 
which flies on the polar-orbiting Air Force weather 
satellite series (the Defense Meteorological Satellite 
Program or DMSP) at an altitude of 800 km. This 
instrument measures the bulk ion velocities in the 
horizontal and vertical directions perpendicular to 
the satellite’s velocity vector. The ion flows are 
sampled six times per second for each component 
and then averaged into four-second bins. As the 
satellite flies over the polar region, this four-second 
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resolution flow data is combined with the magnetic 
field data to calculate the electrostatic potential 
along the satellite’s track. A more detailed descrip- 
tion of the analysis techniques used here is given in 
Hairston and Heelis [1993] and Heelis and Hairston 
[1990]. The interaction of the interplanetary 
magnetic field in the solar wind as it moves past the 
Earth’ s magnetosphere generates an electric field 
across the surface of the magnetopause, which in 
turn produces a potential drop between the 
dawnside and the duskside. This potential drop is 
mapped down to the two polar ionospheres produc- 
ing a potential difference there on the order of tens 
to hundreds of kilovolts. The exact distribution of 
the potential across the polar cap region is compli- 
cated and has been studied and modeled for the past 
twenty years [e.g. Heppner and Maynard, 1987; 
Heelis, 1984; and the references therein]. There is 
general agreement in these studies that when the 
interplanetary magnetic field (IMF) in the solar 
wind is pointed southward (i.e. B z <0) a region of 
positive potential forms on the dawnside of the 
polar region and a corresponding region of negative 
potential forms on the duskside. The shapes and 
sizes of these regions vary widely, but are believed 
to be influenced by the magnitude and sign of the y- 
component of the IMF. Here the y-axis is oriented 
parallel to the Earth’ s orbit with +y directed to- 
wards the duskside. When the IMF is oriented 
northward (i.e. B z >0), there is less agreement about 
the nature of the distribution [see Reiff and Burch, 
1985] but it is generally accepted that the potential 
drop is smaller and the overall potential distribution 
pattern more complicated. While the potential 
perpendicular to the magnetic field in the polar 
region covers a two-dimensional area, the data from 
the satellite only covers a single slice of the pattern 
as the satellite crosses the pole. Thus, all of the 
modeling of the potential pattern to date has been 
based on averaging and binning of data from 
satellite passes at widely different times [e.g. Rich 
and Hairston, 1993; Heppner and Maynard, 1987]. 

The ideal situation would have multiple satel- 
lites in orbit simultaneously, each sampling a 
different region of the pattern. This would insure 
that a true representation of the global distribution 
of the potential could be constructed. Such an ideal 
situation occurred from December 1991 through the 
end of March 1992, when there were four opera- 


tional DMSP satellites in orbit, each in a different 
local time orientation. This provided an unprec- 
edented opportunity to map the overall electrostatic 
potential distribution simultaneously in both polar 
ionospheres and look at short term changes in these 
patterns. An initial analysis was conducted on a 
four-day time period from January 26 through 
January 30,1 992, a time period that also encom- 
passed one of the extended Geospace Environment 
Modeling (GEM) campaign periods. 

3. DESCRIPTION OF THE GRAPHICS 

The overall ionospheric potential distribution in 
one hemisphere can be visualized as a deformed 
three-dimensional surface over the polar region 
(Figure 1). Here the magnitude of the potential at a 
given point is represented by the distance above or 
below a polar dial. The polar dial denotes the 
magnetic local times (MLT) and the circles of 
magnetic latitude down to 50 degrees. In this 
figure, the positive potential region is shown as the 
elevated ridge on the right while the negative 
potential region is shown as the bowl-shaped 
depression on the left. To further reinforce the 
distinction between the positive and negative 
regions, they are also color-coded using grey-white 
for positive and orange-brown for negative. The 
potential curve observed by a single satellite pass is 
presented in the figure as the bright white and 
orange line along the surface, thus giving a cross- 
section of the surface along that track. By visualiz- 
ing the electrostatic potential distribution as this 
“drumhead” surface, the variations in the potential 
can be represented as variation in the size and shape 
of the “drumhead.” 

Since the potential data are available only along 
the satellite tracks, three-dimensional plots of the 
potential curves for all four satellites are overlaid 
for each hemisphere to form a rough, wireframe 
representation of the size and shape of the overall 
potential“drumhead.” Figure 2 presents atypical 
frame from the animation showing an example of 
the data from both hemispheres during this period. 
Each frame represents a timestep of twenty minutes 
centered on the time given in the upper left corner. 
The northern and southern polar dials are presented 
as flat surfaces observed from the nightside looking 
towards the dayside. The polar dials show the most 
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Figure I. This view shows the northern polar region of the Earth as seen from the nightside. The blue polar dial represents the ma gnetic 
latitude in increments of 10 degrees and the crosshair represents the magnetic local times of midnight, 6 am, noon, and 6 pm. The 
translucent surface represents the electrostatic potential distribution in the northern polar ionosphere with the magnitude of the 
potential at any point corresponding to the distance above or below the polar dial. Thus, the positive potential region forms a grey- 
white ridge on the downside and the negative potential region forms an orange-brown bowl-shaped depression on the duskside. A 
sample potential curve that would be observed by a single satellite pass is represented as the bright curve across the surface. 


recent passes in each hemisphere for all four 
satellites. Since the orbital periods of all four 
satellites are around 105 minutes, this means the 
pattern in each hemisphere is composed of passes 
of differing ages. In order to differentiate between 
the most recent passes and the older passes, the 
potential curves are color-coded [Tuft, 1990]. A 
new pass is one in which the satellite reached its 
highest magnetic latitude during the twenty-minute 
period of the current frame, and is colored the 
brightest. A pass that occurred before the current 
frame, but within the past hour is given a medium 
color, and any pass more than one hour is shown in 
the darkest colors. Thus, in the animation the 
potential curve for a given pass in a given hemi- 
sphere will be bright when it first appears, fade to a 


medium value for the next two frames, then fade to 
a dark tone for another two or three frames before it 
is replaced by the curve from that satellite’s next 
polar pass in that hemisphere. The observed 
maximum and minimum potential on each 
satellite’s pass are represented by vertical red lines 
between the curve and the polar dial. For times of 
southward IMF, these vertical lines roughly denote 
the polar cap boundary on each polar dial. In the 
lower left comer of the frame is a plot of the K p 
index during the four-day period of this data set 
with a vertical bar marking the current time. 

In the northern hemisphere in Figure 2, the four 
potential curves suggest a surface with a large 
dome-shaped positive potential region that extends 
from the dawnside past the noon-midnight line and 
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Figure 2. This is a typical frame from the animation showing the potential curves observed by the satellites in both hemispheres 
presented on polar dial graphs indicating the magnetic local times ( MLT) and magnetic latitudes. Note how the curves mesh together 
to form a wireframe model of the surface describing the overall potential distribution. New passes that occurred during the 20 minute 
period represented by this frame are shown in brightest colors. Previous passes less than one hour old are shown in medium colors 
and passes more than an hour old are shown in dark colors. The vertical red bars indicate the location and magnitude of the maximum 
and minimum potentials observed along each pass. In the northern hemisphere the largest maximum bar represents 38.8 kV at 5 . 7 
MLT and 76. 9 degrees magnetic latitude, while the largest minimum bar represents -61.6 kVat21 MLT and 69. 1 degrees magnetic 
latitude. The yellow arrows on the crosshairs on the dayside of the polar dials represent the averaged magnitude and orientation of 
the IMF in the y-z plane during the time period for this frame. Here By = -18.2 nT and B z = -0.8 nT. In the bottom left corner is a 
graph showing the K p index for this study period with a vertical bar indicating the current time. 


partway into the duskside. The negative potential 
region is shaped like a valley and confined to the 
duskside. The fact that the different colored curves 
merge to give a coherent shape indicates that this 
potential distribution had been steady for some 
time. This distribution is consistent with the 
observed distribution during times when the IMF B z 
is weakly negative (southward) and B^, is strongly 
negative [Rich and Hairston, 1 993; Heppner and 
Maynard, 1987]. At these times the ion convection 
pattern in the northern hemisphere is dominated by 
a large dawn cell. The averaged IMF orientation 
and magnitude in the y-z plane for this twenty 
minute period is given as an arrow in the crosshair 
on the dayside of the polar dials. Note that the 


orientation of the crosshair is what an observer 
would see if looking towards the sun (i.e. +z is 
upward and +y is to the left). There is a time lag of 
about 36 minutes between the time the IMF was 
observed in the solar wind by the IMP-J satellite 
and the time it is displayed on the crosshair. This 
time lag corresponds to six minutes for the solar 
wind transit time between the IMP-J satellite and 
the nose of the Earth’s magnetopause, plus a 30 
minute delay for the entire polar ionosphere to 
respond to the IMF. This 30 minute response time 
of the ionosphere was determined empirically using 
this four-day data set. 

In the southern hemisphere in Figure 2, the 
pattern formed by the three potential curves is quite 
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different. (The fourth curve in the 1400-1600 MLT 
region is flat showing that this satellite pass did not 
enter the auroral region.) Here the positive 
potential region is shaped like a ridge that forms a 
crescent shape on the dawnside while the negative 
potential region forms a large bowl-shaped basin 
extending from the duskside into the dawnside. 

This is also consistent with the observed potential 
distribution in the southern hemisphere during the 
times when the IMF is predominantly oriented in 
the -y direction and a large dusk cell dominates the 
ion convection pattern [Rich and Hairston, 1993; 
Heppner and Maynard, 1987], Having multiple 
satellites providing simultaneous observations in 
both polar ionospheres clearly demonstrates that the 
potential distributions in both hemispheres are not 
mirror images of each other. It should be noted that 
the orientation of both the northern and southern 
hemispheric polar dials in Figure 2 match the 
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orientation seen in Figure 1 (dusk on the left, dawn on 
the right). This orientation was chosen to emphasize 
the comparisons between each side of the pattern and 
its counterpart in the other hemisphere . 

4. ANALYSIS 

Once all the frames were generated for this 
study period, they were connected together on the 
computer so they could be played in sequence as an 
animation. This proved to be an effective means of 
presenting the four days of data from four satellites 
in two hemispheres along with the IMF and K p data 
in a form that was clear and understandable to the 
investigator. The interactive nature of the anima- 
tions allowed the investigator to “play” the data at 
variable rates, to stop and examine a single frame at 
length, or even move backwards through the data. 
This animation presented the evolution of the 



Figure 3. This frame presents a typical distribution seen when the B z component of the IMF had been strongly positive (northward) 
for over an hour. During this twenty-minute period the averaged component values were B y = -1. 5 nT and B z = +4.9 nT. Notice that 
the overall distribution is no longer organized into the two-cell pattern seen in Figure 2. The surface described by these curves is much 
flatter than in Figure 2 indicating that the electrostatic potential differences are much smaller. 
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potential distributions in both hemispheres over 
time and demonstrated periods of both extended 
stability and rapid change. This animation was 
recorded onto videotape for presentation at the 
spring 1993 AGU meeting (see note at the end of 
the article). This data set provided a wealth of 
information that is still being analyzed about the 
behavior of the ionosphere in response to the IMF. 

In this short report, only four cases from the 
four day study period will be presented. Figure 3 
shows a typical pattern observed when the IMF 
orientation is predominantly B z positive (northward 
IMF) . The overall potential drop between the 
dawnside and duskside is much smaller than when 
B z is negative (southward IMF), thus causing the 
surface to appear much flatter than it did in Fig- 
ure 2. Also, each of the potential curves shows 
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multiple oscillations rather than the simple sine 
wave curve seen in the curves in Figure 2. Taken 
together the curves indicate that the overall distri- 
bution is complex in shape and less organized than 
during times of southward IMF. This distribution is 
consistent with other observations during times of 
strongly northward IMF [Cummnock et al., 1992; 
Heppner and Maynard, 1987]. During conditions of 
northward IMF there is little or no reconnection 
between the Earth’s magnetic field and the IMF, 
which accounts for the small potential drop between 
the dawn and duskside. The large scale two-cell 
convection pattern seen during southward IMF 
becomes distorted or breaks up into multiple 
smaller cells during extended periods of northward 
IMF. A satellite crossing through these multiple or 
distorted cells sees multiple reversals in the direc- 



Figure 4. This frame presents a typical distribution when both By and B z are negative and roughly equal in magnitude. During the 
twenty minutes of this frame the averaged IMF components were By = -12.4 nT and B z = -10.4 nT. While the distribution is similar 
in overallform to the one in Figure 2, the rotation of the IMF towards the south changes the sizes of the potential regions. The northern 
positive potential region contracts back to the noon-midnight line compared to Figure 2, and the southern positive potential region 
increases in width. In the northern hemisphere the largest maximum bar represents 48. 2kV at 10.2 MLT and 75.9 degrees (magnetic 
latitude), while the largest minimum bar represents -62. 1 kV at 19.3 MLT and 68.0 degrees. 
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Figure 5. There were no available IMF data during the study period which showed the IMF orientation when By is positive and B z 
is negative. However, this frame, taken from a period of the study when the IMF data were missing, presents a typical distribution 
pattern that is consistent other observations when By is positive and B z is negative. The "wedge " on the IMF crosshair indicates an 
estimate of the orientation of the IMF. During such periods, the positive potential region in the northern hemisphere contracts over 
towards the downside and the positive potential region in the southern hemisphere expands out to past the noon-midnight line. In the 
southern hemisphere, the largest maximum bar represents 65.6 kV at 4.0 MLT and -74.2 degrees magnetic latitude, while the largest 
minimum bar represents -47.7 kV at 18.5 MLT and -67.6 degrees magnetic latitude. 


tion of the horizontal ion flow during its pass, thus 
accounting for the multiple oscillations seen in the 
corresponding potential curve. 

During times of predominantly negative B z 
(southward IMF), the overall convection pattern 
becomes organized into two large cells: a dawn cell 
and a dusk cell. These convection cells correspond 
to the positive and negative potential regions 
(respectively) observed in the potential distribution. 
During times of southward IMF, there is a strong 
interaction between the Earth’s field and the IMF, 
which drives the ion convection cells in the iono- 
sphere and produces the potential drop between the 
dawn and duskside. Figure 4 shows the distribution 
for a frame when B z and B^ had both been negative 
for several hours. Notice that the potential distribu- 


tions are similar to those described in Figure 2. 
However, in this case, the magnitude of B z is 
roughly the same as B y which results in the widen- 
ing of the crescent-shaped positive potential region 
in the southern hemisphere relative to Figure 2. 
Also, there is a corresponding shrinkage of the 
region of positive potential in the northern hemi- 
sphere to where it only reaches to the noon-mid- 
night line [Rich and Hairston, 1993]. 

During times of southward IMF, it is the 
direction of the y-component of the IMF that largely 
determines the shape and extent of the convection 
cells and the overall shape of the potential 
distribution. Unfortunately, there were no IMF data 
available during this study period showing B z 
negative and B y positive. However, Figure 5 shows 
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a frame from a period without IMF data that 
corresponds to the distributions seen at other times 
when B z was negative and B^ was positive. The 
positive potential region in the northern hemisphere 
has now contracted over to the dawnside forming a 
ridge shape while the negative potential region 
forms a shallow basin shape. In the southern 
hemisphere, the positive potential region has 
expanded past the noon-midnight line into duskside, 
confining the negative potential to a small region. 
Notice that the difference between the patterns in 
the two hemispheres is not as great as in the cases 
for Figures 2 and 4. This is consistent with the 
observations in Rich and Hairston [ 1 993] for 
conditions when IB Z I > IB^I. The wedge shape on 
the IMF crosshair indicates the absence of any 
actual IMF data, as well as the estimate that the 


IMF orientation is in the negative B z /positive B >t 
quadrant. 

One effect of examining the data in an animated 
form is that the time history of the previous passes 
make changes in the potential distribution 
particularly easy to identify. Figure 6 presents a 
frame which shows a dramatic change in the 
potential distribution as a result of a change in the 
IMF. In the previous frames, B z was positive and 
B y was strongly negative. The faded passes in the 
northern hemisphere of this frame show that the 
positive potential region was weak and somewhat 
disorganized. The bright pass in the northern 
hemisphere is the only new pass to appear during 
this twenty minute period after B z turned sharply 
negative. This new potential curve clearly shows 
that the positive region had increased in size and 



Figure 6. This frame demonstrates using the animation to identify changes in the potential distribution. The dimmed curves in the northern 
hemisphere are flat and disorganized in the positive potential region, while the negative potential region remains relatively deep and 
organized. This is indicative of the IMF orientation having a large negative B y component and a targe positive ( northward) B z component. 
During the previous twenty minute time period prior to this frame the averaged component values were B y = -8 . 7 nT and B z = +6.3 nT. 
However, the B z component turned negative (southward) at the end of that period and during the current twenty minute period of this frame, 
the averaged component values have changed to B y = -14.4 nT and B z = -9.3nT. The new potential curve that appears during this frame 
in the northern hemisphere shows the dramatic change in the size and shape of the potential distribution pattern. 
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strength compared to the previous potential curves 
in the northern hemisphere. Such events were used 
to quantify the response time of the ionosphere to 
various changes in the IMF as mentioned in 
section 3. 

5. PRODUCTION OF THE ANIMATION 

The animation and figures were produced using 
a V AX 4000 VLC workstation in conj unction with 
a Commodore Amiga 3000 personal computer. The 
processing of the raw telemetry data and the analy- 
sis to determine the electrostatic potential data for 
each pass were performed on the VAX. These 
processed data were used to generate individual 
frames in color on the VAX using a Tektronix 
plotting format. However, there was not an inex- 
pensive way to make the VAX suitable for the 
playback of these individual frames as an anima- 
tion. Even if it were possible, there was no inex- 
pensive way to transfer such an animation from the 
VAX to video. This problem was solved by using 
the relatively inexpensive Amiga. The Amiga was 
hooked into the V AX cluster via ethernet and used 
a public domain terminal emulator called VLT 
[Weinstein, 1991]. The emulator enabled the 
Amiga to convert the plots downloaded from the 
VAX in Tektronix format to the IFF graphics 
format used by the Amiga. Once all the frames 
were downloaded and converted on the Amiga, 
commercially available software was used to string 
the frames together into an animation. This com- 
mercial software also allowed the user to modify 
the colors used and vary the playback speed of the 
animation. Since the Amiga was specifically 
designed as a video/multimedia platform, recording 
the video was a straightforward process of connect- 
ing a genlock (a device that converts the computer’ s 
RGB video output to standard NTSC video signal) 
to the computer, and then videotaping the output. 

6. CONCLUSION 

We have demonstrated in this paper that it is 
possible to create effective and useful scientific 
visualization animations using relatively inexpen- 
sive equipment. These animations enable research- 
ers to see quickly and easily the shapes and sizes of 
the distributions of the electrostatic potential in the 


49 

ionosphere. While these images are not comparable 
to animations produced on high-end systems, the 
overall quality is still quite high and suits the needs 
of the researcher. Most of the work done to date in 
modeling the overall pattern of the electrostati c 
potential distribution has focused on static patterns 
which arise during long periods of stable IMF 
conditions. In reality, the IMF is rarely stable for 
time periods greater than an hour; thus the potential 
distribution in the ionosphere is quite variable and 
does not always match the static models. As 
researchers begin to model this variability and the 
time-dependence of the potential distributions, 
computer animations of the data, such as presented 
here will become an essential tool of that work . 

Note: A copy of the animation of this work (NTSC 
only) will be sent to anyone who sends the authors 
a blank VHS videotape (still in the original wrap- 
per) with a self-addressed stamped return en vel ope. 
Please mail requests to: Marc Hairston, Center for 
Space Sciences, University of Texas at Dallas, PO 
Box 830688 F022, Richardson TX 75083-0688. 

Acknowledgments. The authors wish to thank Dr. 
Ron Lepping of the NASA Goddard Space Flight 
Center and Drs. Barbara Emery and Gang Lu of the 
National Center for Atmospheric Research/High 
Altitude Observatory for providing the IMF data for 
this time period. The authors also wish to thank 
Professor Barry Treu of the Visual Arts Department 
of the University of Texas at Dallas for his assis- 
tance in producing the animation and Dr. Robin 
Coley of the Center for Space Sciences for his 
assistance in this work. This work was supported 
by the Air Force through Phillips Laboratory/ 
Geophysics Directorate under contracts FI 9628-90- 
K-0002 and F 1 9628-93-K-0007 and NAS A grant 
NAGW-1665. 

REFERENCES 

Cummnock, J.A., R.A. Heelis, and M.R. Hairston, 
Response of the ionospheric convection pattern to a 
rotation of the interplanetary magnetic field on 
January 14, 1988,7. Geophys. Res.. 97. 19449-19460, 
1992. 



Hairston, M.R., and R. A. Heelis, High-latitude electric 
field studies using DMSP data, Tech. Rep., PL-TR- 
93-2036, Phillips Lab./Geophy. Directorate, Hanscom 

AFB, MA, 1993. 

Heelis, R. A., The effects of interplanetary magnetic field 
orientation on dayside high-latitude ionospheric 
convection. J. Geophys. Res., 89,2873-2880, 1984. 

Heelis, R.A., and MLR. Hairston, Studies of ionospheric 
dynamics utilizing data from DMSP, Tech. Rep,, GL- 
TR-90-0047(I), Air Force Geophys. Lab., Hanscom 

AFB, MA, 1990. 

Heppner, I.P.. and N.C. Maynard, Empirical high- 
latitude electric field models, J. Geophys. Res. , 92, 

4467-4489, 1987. 


Visualization Techniques in Space and Atmospheric Sciences 

Reiff, P.H., and J.L. Burch, IMF B v -dependent plasma 
flow and birkeland currents in the dayside magneto- 
sphere, 2., A global model for northward and south- 
ward IMF, /. Geophys. Res. ,90, 1595-1609, 1985. 

Rich, F .J., and M. Hairston, Large-scale convection 
patterns observed by DMSP, J. Geophys. Res., in 
press, 1993. 

Tufte, E.R., Envisioning Information, Graphic Press, 
Chesire Connecticut, 1990. 

Weinstein, A., A Valiant Little Terminal: A VLT User’s 
Manual, 121 pp., Stanford Linear Accelerator 
document SLAC-370 (Rev. #3), July 1991. 

Wilhelmson, R., H. Brooks, B. Jewett, C. Shaw, L. 
Wicker, “Study of a Numerically Modeled Severe 
Storm,” video by the National Center for 
Supercomputing Applications, 1989. 


51 









D.M. KLUMPAR, M.V. LAPOLLA*, B. HORBLIT* 

Lockheed Palo Alto Research Laboratories, 

Palo Alto, CA 

A prototype system has been developed to aid the 
experimental space scientist in the display and analy- 
sis ofspaceborne data acquired from direct measure- 
ment sensors in orbit. We explored the implementation 
of a rule-based environment for semi-automatic gen- 
eration of visualizations that assist the domain scien- 
tist in exploring one’s data. The goal has been to 
enable rapid generation of visualizations which en- 
hance the scientist’s ability to thoroughly mine his 
data. Transferring the task of visualization generation 
from the human programmer to the computer produced 
a rapid prototyping environment for visualizations. 
The visualization and analysis environment has been 
tested against a set of data obtained from the Hot 
Plasma Composition Experiment on the AMPTE/CCE 
satellite creating new visualizations which provided 
new insight into the data. 


* Now at Meta Software, Inc., Campbell CA 


1. INTRODUCTION 

The use of increasingly sophisticated 
spaceborne instrumentation in space plasma physics 
generally has not been accompanied by application 
of increasingly sophisticated techniques for analyz- 
ing, assimilating, and displaying the observations. 
The task of creating effective visualizations of 
scientific data has, in the past, been time consuming 
and difficult. It has required significant program- 
ming and graphical design expertise and creativity, 
and results in visualizations with little flexibility. 

As data rates increase new techniques need to be 
put into practice to facilitate more rapid data 
analysis and interpretation. Advanced visualization 
techniques can manage the magnitude and complex- 
ity of the massive data sets generated by new 
instrumentation. We developed a prototype analy- 
sis and display environment to reduce the develop- 
ment time and increase the flexibility of the visual- 
ization process. A key requirement in the design of 
the system is that scientists have the capability to 
quickly change the way the data is viewed to permit 
different lines of inquiry . 

One of the empirical space scientist’ s most 
challenging tasks is to unravel the effects of a large 
number of simultaneously acting physical processes 
using limited data gathered from space-based 
sensors. The scientist has a data set of limited 
scope from which to decipher cause-and-effect 
relationships and extract governing physical prin- 
ciples. Through visualizations the scientist creates 
pictures that facilitate his ability to explore the 
interrelationships in the data. By facilitating rapid 
generation of diverse visualizations, the scientists 
capability to uncover hidden relationships is greatly 
enhanced. 
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For most practicing instrumentalists, the 
process by which their one-of-a-kind pictorial 
representations are conceived and developed has 
not substantially changed in the last quarter century. 
The graphical design process consists of converting 
a scientific objective, observation, or concept into 
specialized software which produces the needed 
graphics. The process for creating the visualization 
software is labor intensive entailing man-weeks to 
man-months of collaboration between the program- 
mer and the scientist. This enormous effort pro- 
duces a single, inflexible visualization that often 
will be used to display the entire mission data set. 
Furthermore, these visualizations are produced one 
frame at a time generally without the benefits of 
animation. Typically, the space scientist looks at 
printed hard copies of each two-dimensional 
visualization to search for relationships. Such 
inflexible visualizations, combined with the labor 
intensive method used to create them, severely limit 
the scientist’s ability to truly explore and analyze 
the data. 

The Oakwood system presented here, addresses 
these issues by combining a cooperative computer- 
aided design (CC AD) [Friedelletal., 1991] ap- 
proach with a domain specific visualization rule 
base to aid a scientist in generating visualization 
programs. The Oakwood system is conceptually 
based on the Lockheed Integrated Visualization 
Environment (LIVE), a cooperative, graphical rule- 
based environment for designing software, process, 
algorithm, and application visualizations [LaPolla 
1992, LaPollaet al., 1993]. Oakwood is a visual- 
ization designer that utilizes commercially available 
analysis and visualization software packages as its 
foundation. The prototype system was imple- 
mented over Advanced Visual System’ s AV S 
visualization package. The Oakwood system 
allows a space scientist to easily express his analy- 
sis in terms of mathematics customary to the field. 
From this expression, Oakwood creates the appro- 
priate visualization program, as well as the appro- 
priate mathematical treatment. The visualization 
and treatment are then applied to space science data 
sets, facilitating analysis and producing three- 
dimensional color pictures. 

In this paper, we describe the prototype devel- 
opment system and briefly illustrate its functional- 
ity. Its application to the space sciences domain has 


been demonstrated by applying it to data obtained 
from the Hot Plasma Composition Experiment 
(HPCE) on the Charge Composition Explorer 
(CCE) satellite of the Active Magnetospheric 
Particle Tracer Experiment mission (AMPTE). In 
the following sections, we describe the specific 
space science domain problem (section 2), the 
prototype visualization system (section 3), and its 
application to the AMPTE data (section 4). We 
conclude with discussions of related work and 
describe the future direction of our work. 

2. SPECIFIC SPACE SCIENCE DOMAIN 
SETTING 

While Oakwood is adaptable to the analysis and 
display needs of many diverse disciplines, we have 
emphasized its initial application to the space 
sciences domain. More specifically, we have 
directed our attention to the arena where direct in- 
s' it u measurements of plasmas, fields, and corpus- 
cular radiation are performed aboard spacecraft. 
This domain has special requirements and needs 
distinct from the broader space sciences domain 
where remote imaging is the primary tool. In 
comparison with the numerous efforts to advance 
the state of analysis and display for image process- 
ing, relatively little effort has been devoted to 
general processing and analysis tools for non- 
imaging space investigations characterized by direct 
measurement. 

In imaging, one collects photons (or other 
quanta) arriving from a distant object to form a 
physical image. Spectral differentiation is often 
utilized to bring out specific physical characteristics 
of the scene and, thus, multi-spectral images are 
often collected. The primary characteristic of such 
images is that physical space is represented on the 
image plane. In contrast, in the direct measure- 
ments arena, the visualization and analysis param- 
eter space is typically not described by physical 
dimensions, but rather by quantities which represent 
physical characteristics of the medium. Thus, direct 
measurement space-science data resides in an 
abstract space, rather than on a plane characterized 
by physical dimensions on an object. For example, 
a suite of direct measurement instruments might 
measure the D.C. magnetic field (a three-compo- 
nent field vector), the D.C. electric field (also a 
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three-component field vector), the flux of particles 
as a function of their energy of arrival, charge, and 
mass composition, and the A.C. electric and mag- 
netic wave field spectra (energy density versus 
frequency). Each parameter is measured as a 
function of time and space (as the satellite moves 
along its trajectory) and all parameters vary in time 
and space with rates-of-change that are highly 
variable as the environment responds to externally 
applied forces, and as the spacecraft moves between 
different plasma regimes. The task of the analyst is 
to decipher the cause-and-effect relationships among 
the variations of the observed parameters and to 
determine the physical processes which govern the 
observed behavior. The directly measured param- 
eters may be compared against one another using 
visualizations, such as line plots, scatter plots, and 
surface representations, or may be used to produce 
false images, where the orthogonal axes correspond 
to various parameters. The measurements may also 
be transformed into higher order quantities such as 
moments of the distribution function or phase space 
density. Other transformations utilize combinations 
of data from multiple sensors giving secondary 
parameters that represent global characteristics of 
the plasma (e.g. plasma beta, ratio of plasma fre- 
quency to gyrofrequency, etc.). The direct and the 
derived quantities are often analyzed in concert with 
predictions from theory. Thus the direct measure- 
ments area is a distinctly different enterprise from 
imaging and requires special analysis and visualiza- 
tion tools distinctly different from those employed in 
image processing. 

3. THE PILOT SYSTEM 

To use Oakwood, a space scientist first deter- 
mines the type of analysis he wishes to perform. 
This analysis may be very specific, such as transfor- 
mation of particle flux to phase space density. Or, 
depending upon the analysis in mind, he may use 
more general analysis techniques, such as Fourier 
transforms. The scientist enters his analysis into 
the system at the space science level via a menu of 
mathematical functions and formulae. Since the 
main goal is an empirical scientific analysis of large 
quantities of data, Oakwood’ s graphical user 
interface is oriented towards mathematics and data 
manipulation. The analysis choices are used to 
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create a calculation which is a combination of AV S 
modules, PV-Wave, and C++ functions. Those 
same choices are used as constraints by the rule- 
base visualization generation component of 
Oakwood. Oakwood also creates a custom graphi- 
cal user interface for the interactive manipulation of 
the data and its visualization. 

Each mathematical function in Oakwood i s 
associated with a type of visualization rule. This 
association is used to constrain the visualization 
planner’s choices. The Oakwood planner fires only 
those rules which satisfy these constraints. Each 
rule in turn creates further constraints that must be 
satisfied. The former constraints we will call 
analysis constraints. The latter constraints, those 
generated by the visualization rules, we will call 
derived constraints. Each visualization rule is 
responsible for functionality in the underlying 
commercial packages. For example, AVS does not 
support animation. However, animation may be 
achieved by displaying successive slices of a 
volume. Animation may also be achieved by 
reading in successive data files and recording the 
graphical output. Thus, the animation can be 
achieved in several different ways using the AVS 
package. The animation rules know this and choose 
a particular method to achieve animation. Derived 
constraints are syntactic constraints based on data 
type output by a given module in the commercial 
packages. In the above example, depending on the 
type of sheer chosen, the system may generate 
images or polygons, thus requiring a different 
choice of AVS modules later in the creation of the 
visualization. 

Once the scientist has chosen an analysis, he 
may modify the visualization con struction by 
directly manipulating the visualization rules 
through the graphical level as shown in the large 
window (lower right window) of Figure 1 . The 
graphical rules are divided into three components, 
following the Cell Model [LaPolla, 1 992; LaPolla 
etal., 1993] as displayed in the window. The first 
is the data description component at the left side of 
the window. The data component models the 
dimension, complexity and type of data input into 
the system. The second is the data manipulation 
component. This component is responsible for 
manipulating the data using mathematical tech- 
niques and formulae, such as Fourier transform or 
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matrix inversion. The third component, shown at 
the right center of the window, is the graphical or 
data presentation component. This component is 
responsible for how the data is to be presented to 
the user, whether as animation, as a solid, as a 
vector plot, etc. At the space science level the user 
manipulates concepts from the space science 
domain. At the graphical level, the scientist ma- 
nipulates concepts more closely associated with 
data and graphics. The choices the user has at the 
graphical level are conditioned by the choices made 
at the space science level. 

The final choice a scientist may make concerns 
how the rules are applied by the visualization 
planning engine (See Figure 1 , experiment scope 
panel of the main window). A scientist may choose 
six different visualization construction methods: 
exhaustive, first, biggest, smallest, most connected, 
and least connected. Each of these methods must 


satisfy all analysis constraints, as well as all derived 
constraints. The exhaustive method means that 
Oak wood tries to create all variations of a 
visualization allowed by the chosen constraints. 
After all the visualizations have been created, the 
scientist may then pick the ones he wishes to use. 
The first method means that Oakwood creates the 
first visualization it can. The biggest method 
creates a single visualization, which uses as many 
functional modules as possible. The smallest 
method creates visualizations with as few 
functional modules as possible. The most 
connected and least connected methods are AVS 
specific. Basically, they are similar to biggest and 
smallest. The analyst may also bound his 
computation. That is, he can determine how much 
of the resources to apply to the creation of 
visualization and data analyses (See Figure 1 , 
experiment bounds panel at lower right). 
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Figure 1. An example screen including windows for( 1 ) the Graphical user interface ( lower right); (2) selection of domain level or graphical 
level (lower left); (3) data experiment window (upper left); and (4) display image (upper right). The graphical level window is used to 
describe the data characteristics, manipulations, and to control the generation of images and their attributes as described in the text. 
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It is also possible to manipulate the visualiza- 
tion after Oakwood has created it. As Oakwood 
creates visualizations, it also creates graphical user 
interfaces for manipulating them. Since the 
Oakwood prototype uses A VS as its main driving 
package, it relies heavily on the AVS interface 
widgets such as “Vector Scale,” “slice plane,” “N 
Segment,” or the choice of axis or interpolation 
methods. This is illustrated in Figure 2. 

Once the visualization program and its interface 
have been created, the user selects data to be read in 
and Oakwood, through the commercial packages, 
produces the visualization (Figure 1). The upper 



Figure 2. An example of the interface window for manipulating and 
modifying visualizations after creation of the prototype image. 


window of Figure 1 displays one spin of the 
AMPTE spectral data. 

4. APPLICATION TO THE AMPTE DATA 

The main strength of the Oakwood system is its 
ability to help the space scientist not only design 
and implement visualizations, but also design and 
implement programs to manipulate data. This 
section gives an example of the use of Oakwood 
and compares the current methods used by space 
scientists to the Oakwood methodology. The fact 
that the visualizations produced by the Oakwood 
system are sometimes simple and straightforward is 
unimportant. What is important is Oakwood' s 
time-saving methodology and the freedom this 
imparts to the scientist. The rapid generation of 
diverse types of visualizations gives the scientist 
new freedom to roam through the data and to 
potentially uncover subtle but important 
interrelationships. 

4.1 Data description 

A subset of observations from the Hot Plasma 
Composition Experiment (HPCE) on the AMPTE/ 
CCE spacecraft was chosen as a test application for 
the system. The Charge Composition Explorer 
(CCE) satellite of the AMPTE program was 
launched in August, 1 984 into a near equatorial 
orbit (inclination 4.8° with apogee of 8.8 R e and 
perigee 1 108 km). The CCE was spin-stabilized at 
1 0 rpm with its spin axis pointing approximately 
toward the sun. The spacecraft carried instrumenta- 
tion to measure composition and charge state of 
ions over a very broad energy range, electrons, 
plasma waves, and the geomagnetic field. The data 
used here, from the HPCE, was taken by a set of 
eight magnetic electron spectrometers that measure 
the flux of electrons from 50 eV to 25 keV [Shelley 
etal., 1985]. Each spectrometer is operated at a 
fixed energy with an energy resolution of 50%. 

The eight instruments are co-aligned with their 
fields-of-view perpendicular to the spacecraft spin 
axis and are operated simultaneously, making 
measurements in unison every 155 msec. Thus, an 
eight point electron energy spectrum is obtained 
every 9.5° of satellite rotation. The spectrometers 
are collimated with a 5° full width conical field-of- 
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view giving an effective full width field-of-view of 
5° x 1 4.5° during each measurement period. As the 
spacecraft rotates, the view directions sweep 
through a range of pitch angles the amplitude of 
which depends on the direction of B with respect to 
the spacecraft spin axis. The full pitch angle scan 
(0°-180°) is achieved when B is perpendicular to 
the spin axis. 

The output of the instrument may be viewed as 
a succession of time slices through energy space. 
Each time slice contains eight measures of the 
instantaneous electron flux (at the eight energies) 
and each successive slice is displaced by 9.5° of 
rotation in the spin plane of the satellite. 

4.2 Current Visualization and Data Analysis 

Methods 

It is customary practice to create specialized 
visualization templates, such as that shown in 
Figure 2 of Shelley et al. [ 1 985], to display the 
essential data from an instrument. Several such 
unique visualizations are typically created to 
display the data for each instrument. The entire 
data run for the experiment is then typically dis- 


played on these visualizations, creating thousands 
of frames of data which are stored on film, micro- 
film, or as hard copies. Selection of data for further 
analysis is carried out by visually scanning the hard 
copies or films to look for repeatable trends or 
interesting or unexplained signatures . Once event 
selection has taken place the analyst usually returns 
to the digital data for further analysis. This stage of 
investigation will involve further manipulation of 
the data and often results in the generation of 
additional specialized visualizations, which serve to 
display various characteristics of the events under 
analysis. Again the process is time consuming and 
labor intensive. 

4.3 New Visualization 

Figure 3 shows a visualization of the HPCE 
data described above. This visualization was 
created through PV-Wave and illustrates a simple, 
but effective enhancement to the presentation of 
particle spectrographic data. The basic display may 
be viewed as an energy vs. time spectrogram, a 
standard visualization in common use in space 
plasma physics. Since each time slice of eight-point 
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Figure 3. An example prototype visualization in energy-pitch angle-time space combines all three interrelated parameters of data 
sets commonly used in experimental space plasma physics (see text). 


Chapter II: Space and Upper Atmospheric Science Applications 


57 


flux measurements represents a different pitch angle as 
the spacecraft rotates, co-display of the instantaneous 
pitch angle is necessary for full comprehension of the 
measurements. This essential parameter is incorpo- 
rated into the plot of Figure 3 by allowing the instanta- 
neous pitch angle to control the height of the three- 
dimensional surface. The corrugated sheet thus 
produced contains, on one panel, all of the relevant 
parameters in an easily assimilable representation. 

This new view replaces two separate pitch angle and 
spectrogram frames of the traditional display. With 
this visualization, the scientist can interactively choose 
to see one or several spins on the same visualization. 
Using the Oakwood system, the scientist can also see 
the same data as a traditional spectrogram and view 
multiple spins as animation. The power that Oakwood 
offers the scientist is the ability to quickly create and 
manipulate such flexible visualizations. 

5. RELATED WORK 

Oakwood is a hierarchical, cooperative design 
system for creating data experiments. Oakwood 
utilizes commercial packages and user provided 
code fragments as its mathematical and graphical 
backbone. This allows the user to quickly design 
and create mathematics programs and visualizations. 
Visualization production can be decomposed into 
articulation and design [Friedell et al., 1992, 1993; 
LaPolla et al., 1993]. Design generates an object 
conceptualization, which is an amalgam of primitive 
graphical sub-objects that are subject to geometric 
and topological relations (or constraints). The sub- 
objects, in this case, are functions in a commercial 
package organized using C++ classes and visualiza- 
tion rules. Articulation is the activity of providing a 
precise graphical description of an object conceptu- 
alization [Friedell et al., 1992, 1993], The above is 
also true of data experiment production. Data 
experiment production can also be decomposed into 
articulation and design. For data experiment design 
in the Oakwood system, the object conceptualization 
and design discovery process is characterized as an 
amalgam of commercial package functions, a data 
characterization model, and design (search) strate- 
gies. Because of the multi-layered nature of 
Oakwood, there are not only geometric and topologi- 
cal constraints in the graphical model, and composi- 
tion constraints in the mathematical model, but also 


high-level contextual constraints imposed on the 
system by the application. 

Various systems have been developed to 
support the articulation task for visualization in a 
general way, e.g., AVS , apE, and PV-Wave. The 
Visualization Design Environment, a component of 
LIVE, supports both articulation and design, but 
only for visualization generation. 

Other systems support the articulation task for 
mathematics, e.g., MathMatica, PV-Wave. Few 
systems support the interaction between distinct 
graphical and mathematics packages, e.g., [Friedell et 
al., 1992, 1993]. Oakwood, however, supports both 
articulation and design for visualizations and 
mathematical programs. Because of Oakwood’ s open 
object-oriented architecture, design and articulation for 
other domains could also be supported. 

We are aware of two related systems under 
development to provide analysis and visualization 
framework with application to the space sciences 
domains. LinkWinds, Linked Windows Interactive 
Data System, under development at Jet Propulsion 
Laboratory, has similar objectives to Oakwood 
[Berkin, private communication, 1993; Jacobson and 
Berkin, 1993] in that it is intended to provide a user- 
friendly environment to support rapid prototyping and 
execution of visualization applications. It differs 
substantially in approach, however, utilizing linkage of 
visuals through a graphical spreadsheet metaphor. Its 
scope extends beyond our plans for Oakwood by 
providing a multi-user environment for conducting 
cooperative research among remote sites. A second 
analysis and visualization tool known as S AV S is 
being developed jointly by Science Applications 
International Corporation (S AIC), Advanced Visual 
Systems, Inc. (AV S), and the University of Mary land. 
S AVS is a combined data acquisition, manipulation, 
analysis, and visualization system for application to 
solar-terrestrial and planetary sciences [Szuszczewicz 
etal., 1992], Like Oakwood, this system is wrapped 
around the AVS visualization system, which provides 
a variety of tools forrendering volume data. 

6. FUTURE DIRECTIONS 

Several new NASA space flight missions, 
currently nearing completion of hardware develop- 
ment and scheduled to be launched in 1994, will 
carry new generation imaging particle spectrom- 
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eters that are expected to produce more comprehen- 
sive data on the plasma environment than any past 
missions. The Polar and Wind spacecraft of the 
GGS program and the Fast Auroral Snapshot 
Explorer of the Small Explorer program will each 
carry a comprehensive set of instruments to mea- 
sure most of the important parameters governing 
the eiectrodynamic behavior of space plasmas and 
fields. The ultimate scientific success of these 
missions will depend upon the ability of the investi- 
gator teams to carry out comprehensive analysis of 
the data from the entire suite of instruments. It is 
missions like these that we have in mind as we 
develop the analysis and display tools described in 
this paper. 

7. SUMMARY 

W e have described a prototype data analysis 
and visualization system that facilitates the rapid 
generation of visualizations with minimal user 
intervention. We have applied the system to the 
analysis and display of multi-parameter data 
collected by spaceborne instrumentation that 
measures the particle, plasma, and electromagnetic 
field environments through which satellites 
traverse. The system applies a rule-based approach 
implemented under the cooperative computer aided 
design paradigm to semi-automatically generate 
visualizations of a specified data set with minimal 
user intervention and guidance. When used under 
the control of a domain scientist, the system pro- 
vides an environment for rapid creation of meaning- 
ful prototype visualizations. The flexibility to 
quickly explore potential interrelationships among 
one’ s data greatly enhances the scientist’ s ability to 
thoroughly explore his data. 
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A series of prog rams for the visualization and analysis of 
space physics data has been developed at UCLA. In the 
course of those developments, a number of lessons have 
been learned regarding data management and data 
archiving, as well as data analysis. The issues now 
facing those wishing to develop such software, as well as 
the lessons learned, are reviewed. Modern media have 
eased many of the earlier problems of the physical 
volume required to store data, the speed of access and the 
permanence of the records. However, the ultimate 
longevity of these media is still a question of debate. 
Finally, while software development has become easier, 
cost is still a limiting factor in developing visualization 
and analysis software. 


1. INTRODUCTION 

A typical space physics investigation produces 
several gigabytes of information annually. The goal of 
the investigator who receives these data is to obtain the 
maximum amount of scientific understanding within a 
limited budget and a limited time frame. In general, 
the available financial resources are insufficient to 
complete the scientific analysis in the available time. 
Thus, to ensure that the data can be eventually ana- 
lyzed, the original data must be archived, as well as the 
data products derived from the initial analysis . This 
archive should be permanent and secure since most 
data are unique and most likely will not be superseded 
in the foreseeable future. Y et at the same time, access 
must remain simple because the data might be needed 
at any time. 

The space physics group at UCLA has faced 
these problems for nearly 30 years in handling data 
obtained on the ATS-1 and 6, OGO 1-6, Apollo 15 
and 16, ISEE 1 and 2, Pioneer Venus Orbiter, and 
Galileo missions. In response to the continual 
problem of optimizing science return in the face of 
constrained budgets on these missions, we devel- 
oped a data management system involving standard 
data formats, utility programs, and analysis pro- 
grams that remained fixed from project to project to 
minimize the software development associated with 
each project. In 1 979, we began the development 
of interactive graphics software as a further aid to 
increased scientific productivity. Also, during the 
mid- 1980s the author served as chairman of the 
National Academy’s Committee on Data Manage- 
ment and Computation (CODMAC) and NAS A’ s 
Planetary Science Data Steering Group. We draw 
on the experience gained in this software effort 
and the deliberations of these two committees and 
their reports [Committee on Data Management 
and Computation, 1982; 1986; 1988] in writing 
this report. 



60 


Visualization Techniques in Space and Atmospheric Sciences 


It is not our purpose in this paper to describe 
the details of the software developed during this 
effort or to impress the reader with its capabilities. 
Rather, our purpose is to share our experience and 
lessons learned and assist future users and develop- 
ers of similar efforts in this and allied disciplines. 

2. DATA MANAGEMENT 

When one begins to analyze large amounts of 
data from disparate sources, the need for standard- 
ization becomes quickly obvious. If one is to 
minimize reprogramming, standard data formats are 
a must. At UCLA we use a very simple structure, 
referred to as a flat file system, in which the data 
are stored in a binary file with time running down 
the first column and the various measurements 
made at that time in successive columns of the data 
matrix. This format is, by nature, best suited for 
data that can be stored as time series, but of essen- 
tially any dimensionality otherwise. The descrip- 
tions of the contents of the binary file are contained 
in a separate ASCII “header” file describing each 
column and its units, the original source of the data 
in the file, the creation date, and any explanatory 
text the creator desires to keep with the data set. 
New data arriving at UCLA are converted to this 
system before use. Exported data are converted to 
other formats as needed by the user if the user 
cannot use our file system. 

Accompanying our standard file system are a 
set of utility programs that allow us to inspect and 
manipulate these files. In developing these utility 
programs, we have tried to make them both easy to 
use and flexible. By maximizing the use of a 
standard set of utility programs, we minimize the 
programming effort required for any new task, and 
we minimize software efforts by making maximum 
use of tested routines. 

It is important that all data be catalogued. 
Moreover, the location of all data should be re- 
corded, and metadata, the data that describe the 
contents of the files, should be solidly linked to the 
data products. Easy access to the catalog is as 
important as access to the data. Providing this 
access may require the development of additional 
software tools. 

Over the last decade and a half, we have 
generally found that commercial programs were 
poorly suited to our needs. In particular, time series 
manipulation and display were generally poorly 
handled while contouring and the display of maps 


or images were emphasized in these tools. Another 
problem with commercial software is that it is 
usually difficult to tailor such software to one’ s 
special requirements. Commercial software can 
also be expensive, but it is seldom as expensive as 
developing one’ s own special purpose software. 
Moreover, software development requires time, as 
well as money. Complex algorithms and the 
associated codes are not developed overnight. One 
solution to this problem is to collaborate with 
groups requiring or possessing similar codes, so 
that development costs are spread over several 
applications. Access to source code is usually 
essential, since no two scientific applications ever 
seem to have identical requirements . 

A serious problem in developing a comprehen- 
sive data management system for a scientific (or 
commercial) application is that the computing 
systems improve at a rate comparable to that at 
which the software is developed. Hence, once the 
software is completed, the computer system is no 
longer state of the art and a new computer and 
consequent software adaptations and developments 
are required in order to stay competitive. Further- 
more, operating systems and the windowing and 
display environment of workstations continue to 
evolve at a rapid rate. Over the last few years, our 
own group has evolved from graphic terminals to 
workstations running S UNVIEW, to OPENLOOK, 
and currently we are evaluating MOTIF. It is 
difficult but in this environment one must strive to 
be as device and operating system independent as 
possible. 

Finally, one important lesson we have learned 
and one to which the CODMAC reports came back 
repeatedly is that software development must take 
place with the ultimate user closely involved. 
Software developed without such close coupling 
usually takes a wrong turn. 

3. DATA ARCHIVING 

The need for attention to the archiving of space 
science data has been receiving increasing attention 
in recent years as the realization struck that access 
to space was becoming less frequent, and not more 
frequent, with time. New missions in general 
complement or supplement old missions, not 
duplicate them. It is no longer possible to obtain 
approval to repeat an old mission simply because 
instrumentation has improved. Thus, most data 
obtained in space are unique. A corollary of this 
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observation is that if data are lost through some 
catastrophe, they are unlikely to be replaced in the 
foreseeable future. 

Every group managing the sole or primary copy 
of a data set should establish an archive. Such an 
archive should have a searchable catalog, so that the 
contents of the archive can be determined readily. 
The data in the archive should be readily accessible. 
In the past, large data sets had to be stored in 
warehouses, but modern media have obviated the 
need for such warehouses. While the resources 
must be made available to copy the data, this 
copying cost is small compared to the cost of 
maintaining the warehouses. Care must be exer- 
cised in the choice of the new media, however. The 
lifetime of magnetic tapes of all types (of the order 
of 10 years) seems to be much less than that of 
optical disks and compact disks. Finally, we 
emphasize that with modern electronic networks, 
the site of an archive is becoming less important. It 
is possible for an archive to be distributed among 
several sites. For example, measurements of 
plasma temperature and density from a mission 
might be housed at one institution while the electric 
and magnetic field data are at another and the 
trajectory data somewhere else. 

UCLA maintains an archive of magnetic field 
measurements from a number of past missions. In 
maintaining that archive we have learned a number 
of important lessons. First of all, we learned that 
seven and nine track tape does not provide a 
permanent archive. We still have much data on this 
medium, but we are attempting to copy those data 
as quickly as possible. At the urging of the Na- 
tional Space Science Data Center (NSSDC) we 
initially attempted to use 12-inch Write Once Read 
Many (WORM) optical platters. We copied all our 
experiment data records and supplementary data 
records for both the ISEE and Pioneer Venus 
projects from tape to WORM. However, we found 
the platters difficult to write and the equipment 
difficult to maintain. It is now clearly obsolete 
technology. 

About two years ago, we purchased several 
Sony writers for erasable optical disks (EOD). Each 
have a data volume of about 600 MB. These were 
an immediate success among those analyzing and 
processing data. They are relatively inexpensive 
and are easy to use. Most recently we have begun 
to use the new 1 .3 GB EODs. We also use 
EXABYTE 8mm tapes for backing up hard disks 
and for transporting data and have had no problems 


in these applications. We are now beginning to 
write CD-ROMs. While the cost of CD-ROM 
media is less than that of an EOD ($25 versus $ 1 00 
for a 600 MB disk), the CD-ROM writer and 
software is much more expensive ($ 1 0,000 versus 
$3,000). On the other hand, if one only wishes to 
read CD-ROMs and not to write them, the cost is 
trivial. Moreover, jukeboxes allowing access to 
large numbers of CD-ROMs are in the offing. 

Thus, the CD-ROM may be the medium of choice 
for future large data libraries. Finally, in choosing 
media one should remember that the access time of 
the various media types vary and that access time 
could become a limiting factor in an application. 
Table 1 lists average seek times and maximum 
transfer rate for a number of current media types. 
As attractive as CD-ROMs are, they have several 
disadvantages, such as limited capacity and moder- 
ately slow transferrates. 


Table 1. Data Access Rates 


Medium 

Average 
Access Time 

Maximum 
T ransfer Rate 

Current magnetic disks 

~8 msec 

-10.0 MRytes/sec 

Current EODs (1.3 GB) 

-36 msec 

-1.6 MBytes/sec 

Optimum 12” WORM 

-180 msec 

-0.7 MBytes/sec 

Double speed CD-ROM 

-200 msec 

-0.3 MBytes/sec 

8mm "Exabyte” (5 GB) 

-1000 sec* 

-0.5 MBytes/sec 

4mm DAT (2 GB) 

-400 sec* 

-0.35 MBytes/sec 

6520 BPI 9-track tape (140 MB) 

-350 sec 

-0.2 MBytes/sec 


*Time to search through 50% of a tape based on vendor's published “Search 
Rate” of ~2.5 MBytes/s. 


The choice of media is not the only issue facing 
the archivist. Security, longevity, and documenta- 
tion are also critical issues. All irreplaceable data 
should be copied and stored in two physically well- 
separated sites as a precaution against the loss of 
the archive due to some catastrophe, such as a fire 
or flood. Frequently handled data should be 
duplicated and one copy held in reserve. Documen- 
tation should not become separated from the data it 
documents. It is best that the documentation be put 
into machine readable form and stored on the same 
volume as the data. Furthermore, software used in 
creating data should be stored on the same volume 
as the data created. While software is not a perfect 
source of documentation, it is far better than having 
no documentation. 
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4. DATA ANALYSIS 

The ultimate purpose of managing a data set and 
maintaining an archive is to facilitate the analysis of 
the data. Our developments in this area have also 
taught us some important lessons. First, the entry into 
the program should be easy. At most, the user should 
have to remember the name of the program, but even 
that can be avoided with careful system design. 
Similarly, access to the data should be made trivial. A 
program should lead the user to the requisite files. 
Operations on the data should take seconds and not 
minutes in any interactive application. Otherwise, the 
creative process is interrupted. The program should 
communicate with the user in English and the instruc- 
tions to the program should be intuitive, so as not to 
require prior knowledge. The program must provide 
the functionality required by the user and it must be 
flexible. Finally, a screen is not the ultimate output 
device. A user needs to save the final results of the 
analytical session. Options for saving the results 
include paper, disk, tape, etc. 

While it is easy to write down the desired 
attributes of analysis software, it is much more 
difficult to achieve it because specialized software 
development is expensive and time consuming. 
Moreover, no agency wishes to pay for this soft- 
ware development. Standard grant sizes are too 
small to afford a full time code developer and 
computer science grants generally go to advanced 
development rather than to applications. Again, the 
only practical solution in most situations is to 
attempt to collaborate with analysts with similar 
needs or to adapt existing codes. 

Fortunately, many problems that we once faced in 
interactive graphics analysis have disappeared. The 
speed of machines is now no longer a problem in most 
applications . Memory sizes are usually quite suffi- 
cient. Data storage is now relatively inexpensive and 
data access times for most devices are not a limitation. 
Moreover, modern electronic networks have facilitated 
the exchange of both data and software. In fact, the 
various local, national and international networks have 
become essential to the conduct of space physics 
research at the present time. 

5. INTERACTIVE GRAPHICS ANALYSIS IN 

THE UCLA SPACE PHYSICS GROUP 

Our first developments in interactive graphics 
analy si s began in late 1 979 and 1980 when we ob- 
tained access to a Tektronix 4014 graphics terminal. 
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Our main computers in the 1970s and early 1980s were 
a series of HP minicomputers, an HP2 1MX, an HP 
FI 000 and an HP A900 in succession. The 4014 was 
too expensive for us to buy multiple units, but it was 
sufficient to develop software and techniques for 
interactive graphics analysis and it soon became 
oversubscribed by students and staff. Fortunately, HP 
soon developed microcomputers (HP 1 50) and termi- 
nals (HP2623) that had graphics and text screens (two 
windows) and that were inexpensive. We could then 
expand our service to multiple users although we had 
to be careful not to overload our minicomputer sys- 
tems. In the mid-1980s, we were able to purchase a 
microVAX with a VMS operating system and we 
converted our programs to run under VMS . At this 
point we were able to export our software to a number 
of different collaborating groups. Thus, early versions 
of our software are in the hands of various research 
groups, some modified for user-specific applications. 

In the late 1980s, we modified our software to run 
under UNIX on SUN workstations. We initially used 
the Template Graphics Package that was also available 
under VMS . We experimented with SUNVIEW, but 
as it became increasingly available and capable, we 
switched to using the OPENLOOK tool kit for X- 
Windows. We continue to upgrade the functionality 
of the software. The analysis software has been used 
in our analysis of data from the ISEE, Pioneer V enus, 
AMPTE, and Galileo missions. Some of the interac- 
tive graphics programs we have developed are de- 
scribed below. The series of programs all have names 
ending in ANAL, which stands for analysis. The first 
letter indicates the type of analysis. 

5.1 Banal 

This program was our first attempt at interactive 
graphics analysis and was designed to provide the tools 
needed to analyze microphysical processes (waves, 
discontinuities, boundaries) as observed in the vector 
magnetic field, B [Russell, 1983]. Thus there were 
tools to rotate, filter, Fourier analyze, and display 
magnetometer data. The program enables the calcula- 
tion of averages, principal axes, wave properties, and 
hodograms, as well as time series. 

5.2 Tanal 

When a more global analysis was needed, such 
as viewing the pile-up of the interplanetary mag- 
netic field in front of an obstacle, or the comparison 
of the field with a model over some volume of 
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space, it was helpful to include the trajectory (T) 
with the vector data and to display the field as 
vectors along the trajectory. This analysis was 
facilitated with the TANAL program, which also 
allowed plots versus altitude, as well as versus time, 
and in projection or planes. 

5.3 Manal 

The analysis of the state of the magnetosphere 
using ground-based magnetograms (M) has always 
been difficult because of the difficulty in accessing 
those records. During the International Magneto- 
spheric Study (IMS), a number of digital stations 
were established. We acquired those data, sorted 
them by time and developed MANAL to provide 
access to the records and their metadata. MANAL 
also enabled their display and manipulation. As 
machines became more powerful, it became fea- 
sible to include dynamic spectral analysis in the list 
of interactive techniques. This capability is now 
included in MANAL. Moreover, we can perform 
cross correlation analysis, such as dynamic coher- 
ence and dynamic phase differences. 

5.4 Uanal 

The Pioneer Venus project assembled a key 
parameter data set patterned after the Unified 
Abstract Data Sets (UADS) of the AE and DE 
projects. We kept this key parameter data set on 
line and developed the UANAL program to access, 
manipulate, and display it. UANAL can display 
time series or altitude profiles, can display data 
from any instrument, by itself, or overlaid on other 
data. The user can select scales and intervals. Data 
can be combined to create derived parameters and 
can be compared with models. Measurements can 
be made from the screen with a cursor. 

6. CONCLUSIONS 

Modem computers and their software 
environments provide almost unlimited capabilities 
for data analysis and visualization. Nevertheless, 
data management issues are still a concern, in 
particular, standardization and user involvement. 
Data archiving has been facilitated by modem 
media, but documentation, media permanence, and 


protection from catastrophes are still important 
issues. Finally, and very importantly, cost is still the 
limiting factor in software development. 

7. APPENDIX: THE UCLA FLAT FILE 

SYSTEM 

The flat file system is a two-dimensional data 
base system developed by the Space Physics group 
in the Institute of Geophysics and Planetary Physics 
at UCLA. The initial design of the system was 
developed by N.E. Cline in the early 1980s and was 
adopted for the analysis of the magnetometer data 
for both the International Sun Earth Explorer 
program and the Pioneer Venus Orbiter program. 
The initial code ran on the group’s HP minicomput- 
ers. C.R. Clauer, then at Stanford University, 
ported the code to the VAX/VMS environment. In 
doing this, he made several changes in the format, 
most notably the origin of the time word, so that his 
files and those created at UCLA are not inter- 
changeable. The Stanford system migrated to the 
Space Research Institute in Graz where extensive 
work took place on the TANAL analysis program 
and some on the BANAL program. The Hungarian 
PDS node also runs this version of the flat file 
system. Since the Graz group used IBM compatible 
PC computers extensively, it developed a PC-based 
flat file system and adapted some of the ANAL 
tools to the PC environment. 

Eventually, the UCLA Space Physics group 
was able to purchase a VAX/VMS system. Be- 
cause of their extensive investment in software and 
data sets in their original flat file format, the UCLA 
Space Physics group did not adopt the Stanford 
system, but ported their own standard format to the 
VAX/VMS. This version was exported to the JPL 
Space Physics group and enhanced by 
D. Winterhalter. In the late 1980s, UNIX became 
the operating system of choice. The Planetary Data 
System node at UCLA, the Planetary Plasma 
Interactions Node, decided to convert the flat file 
system to the UNIX environment. They did so, but 
in the process decided to increase the number of 
header files so that these flat files again were 
incompatible with the earlier set. The Space 
Physics Group, again, because of their large invest- 
ment in program and data sets, decided they could 
not accommodate this format change and remained 
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with their original format. The net result is that there 
are several flat file formats in use, all with common 
ancestry, but slightly incompatible. Fortunately, it is 
simple to convert from one to another. 

Flat files are useful for storing data which exist 
as multiple records or rows, each consisting of one 
or more data fields or columns. In space physics 
applications, they are most often used for time 
series of real numbers, but the data fields can 
contain most common data types such as: floating 
point, integer, character, and logical. A UCLA “flat 
file” actually consists of two files, an ASCII 
character header file and a binary data file. The 
ASCII header describes the associated data file with 
information, such as record length, number of rows, 
creation date, description of each data field or 
column, and user- written text. There are four parts 
to a header file. Lines 1 to 6 contain a physical 
description of the file (name, creation date, record 
length, number of rows and number of columns). 
The section from line 7 to the abstract line contains 
descriptions of each of the columns (column 
number, variable name, units, source, data type, 
location). The abstract consists of two parts, a 
fixed form, keyword section and a free-form user 
written section. The fixed form section contains 
information such as start and stop times, owner, 
flags, orbit number, averaging interval, etc. The 
binary data files have a fixed record length and can 
be accessed randomly. The user is not confined to 
sequential access. Since numerical data are stored 
in binary format, the record processing is much 
faster than in character data. The user can specify 
the number of data buffers to optimize disk access. 

One of the important assets of the flat file system 
is the existence of an extensive library of utility 
programs which act on the flat files. One of the most 
useful programs is the file lister and plotting program, 
FL, which can be used to inspect any flat file. This 
program allows you to do fast and sequential searches 
for a given time, to compute averages, maxima, 
minima and standard deviations, to apply masks to the 
data, to convert to ASCII format, to plot selected 
columns versus time or each other and to perform 
various otherfunctions. 
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The UCLA flat file system has been used exten- 
sively by the UCLA Space Physics group. However, 
the true mark of its utility lies in the scientific analysis 
that it has enabled. Almost all the analysis performed 
by this group and the resulting papers have relied on 
the existence of this software. Over 1 000 papers at 
meetings and in journals and books have been sup- 
ported by this software. 
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Over the years, the National Space Science Data Center 
has accumulated a rich archive of heliospheric, magneto- 
spheric, and ionospheric data, as well as data from most 
other NASA-involved science disciplines. To facilitate 
access to and use of these data, NSSDC has begun to put 
selected data onto CD-ROMs. This paper describes one 
such CD-ROM, and the access and display software 
developed at NSSDC to support its use. The data on the 
CD-ROM consist primarily of hourly solarwindmagnetic 
field and plasma data from many near-Earth spacecraft 
( OMNI) and deep space spacecraft (Voyagers, Pioneers, 
Helios, Pioneer Venus Orbiter). In addition, 5-minute 
resolution IMPS and ISEE-3 magnetic field and plasma 
data are also included. Data are stored in both ASCII and 
CDF formats. 
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1. INTRODUCTION 

The National Space Science Data Center 
(NSSDC) has collected data obtained by spacecraft 
in the Earth’s magnetosphere and interplanetary 
medium for over three decades. NSSDC has accu- 
mulated the largest collection worldwide of inter- 
planetary magnetic field (IMF), solar wind plasma 
(SWP), magnetospheric field, and plasma measure- 
ment data. These data sets are accessible on mag- 
netic tapes and partially ‘on-line’ via computer 
networks. Magnetic tapes take time to retrieve data 
and do not provide any visualization software. The 
‘on-line’ access is available for the scientific 
community in developed countries only. Thus, the 
existing data sets ought to be placed on modem 
media (e.g. CD-ROM) with ‘user-friendly’ software 
to be distributed worldwide. 

In 1992 NSSDC began to create a series of 
Space Physics CD-ROMs, starting with 
heliospheric IMF and SWP data sets from various 
spacecraft travelling through the solar system: 
several near-Earth spacecraft (OMNI), Helios 1 & 

2, V oy ager 1 & 2, Pioneer 1 0 & 1 1 , and Pioneer 
Venus Orbiter. The OMNI data base (one of the 
most widely used in the space physics community) 
contains hourly averaged IMF and SWP data near 
Earth for 1 963- 1 99 1 . In addition, 5-minute average 
data from near-Earth spacecraft IMP-8 and ISEE-3 
are included. The 5-minute IMP magnetic field data 
are from the interplanetary, magnetosheath, and 
magnetotail phases of the IMP orbit, while the 5- 
minute IMP plasma data are primarily interplan- 
etary with a small admixture of magnetosheath 
data. 

The collected heliospheric IMF and SWP data 
sets have been provided to NSSDC by Principal 
Investigators (Pis) of the corresponding instruments 
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onboard given spacecraft. The details of provided 
data sets are presented in [Couzens and King, 1986] 
and may be retrieved from the ‘on-line’ 

Coordinated Heliospheric Observations (COHO) 
directory in the NSSDC ANONYMOUS account 
(see below for NSSDC ordering and access 
information). These data sets have been created by 
Pis in different formats and time conventions, and a 
variety of coordinate systems have been used for 
data presentation and location of spacecraft. 
Therefore, we have made uniform the data formats 
and coordinate systems. Also we have transformed 
data as needed in order to provide data from 
different spacecraft in the same coordinate systems. 
Note in particular that two ASCII versions of the 
OMNI data are included on the CD-ROM, one the 
familiar native format (no position data, field and 
flow vectors in GSE coordinates) and the other in 
the standard format (Earth position data, field and 
flow vectors in RTN coordinates). 

The hourly resolution data files contain, for any 
given spacecraft, the heliocentric position of the 
spacecraft (or planet, for planetocentric orbiting 
spacecraft), the magnetic field intensity and Carte- 
sian components, and the solar wind plasma flow 
speed, density, temperature, and, for some space- 
craft, flow direction angles. The 5-minute resolu- 
tion files have basically the same parameters, 
except that geocentric spacecraft position is given. 
Further details on the parameters are given below 
and are included in on-disk documentation files. 

The CD-ROM and associated software described in 


this paper may be acquired by request of NSSDC at 
NASA Goddard Space Flight Center, Code 633.4, 
Greenbelt, MD, 2077 1 , or 301-286-6695 by phone, 
or REQUEST@NSSDCA.GSFC.NASA.GOV on 
Internet. 

2. DEFINITIONS OF COORDINATE 

SYSTEMS 

The NSSDC Heliospheric CD-ROM contains 
the trajectory positions of all spacecraft providing 
the measured data (Table 1). Trajectory positions of 
all spacecraft are given in Heliographic Inertial 
(HGI) Coordinate System. The HGI coordinates of 
the planets Earth and Venus have also been com- 
puted to provide a uniform disposition of the near- 
Earth and near- Venus spacecraft together with the 
deep space observations. The Geocentric Solar 
Ecliptic (GSE) and Venus Solar Orbital (VSO) 
Coordinate System have been used to locate the 
positions of corresponding spacecraft relative to 
Earth and Venus. Some of the following coordinate 
systems are described by Roy [1988]. 

Solar Ecliptic Coordinate System ( SE). The SE 
system is a heliocentric coordinate system with the 
Z axis normal to the ecliptic plane. The X axis goes 
toward the first point of Aries (Vernal Equinox, i.e. 
to the Sun from the Earth at the first day of Spring), 
and the Y axis completes the right handed set. 

Heliographic Inertial Coordinate System 
( HGI). The HGI system is a heliocentric coordinate 
system with the Z axis along the Sun’s spin axis 


Table 1. Spacecraft and Data Sets Coverage, Resolution, and Coordinate Systems 


Spacecraft 

Data Coverage 

Resolution 

IMF 

SW Plasma 

Location 

OMNI 

11/1963- 07/1991 

hour 

RTN & GSE 

RTN & GSE* 

HGI 

IMP-8 

10/1973-06/1991 

5-min 

GSE 

GSE* 

HGI & GSE 

ISEE-3 

06/1979- 12/1983 

5-min 

GSE 

none 

HGI & GSE 

Helios- 1 

12/1974- 12/1980 

hour 

RTN & SSE 

SSE* 

HGI 

Helios-2 

01/1976-03/1980 

hour 

RTN & SSE 

SSE* 

HGI 

Pioneer Venus 

12/1978 - 08/1988 

hour 

RTN & VSO 

VSO* 

HGI & VSO 

Pioneer- 10 

03/1972- 12/1988 

hour 

RTN 

no angles 

HGI 

Pioneer- 1 1 

04/1973-08/1992 

hour 

RTN 

no angles 

HGI 

Voyager- 1 

09/1977-08/1981 

hour 

RTN 

RTN 

HGI 

Voyager-2 

08/1977- 12/1990 

hour 

RTN 

no angles 

HGI 


* The SW plasma flow longitude angle is measured from -X (GSE, SSE or VSO) to -Y, i.e. positive for rotation of the 
flow in the same sense as the rotation of the Sun. 
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(positive to the North) and the X and Y axes in the 
solar equatorial plane. The X axis goes along the 
intersection line between the ecliptic and solar 
equatorial planes outward from the Sun in the 
general direction of winter solstice. The Y axis 
completes the right handed set. The ecliptic (SE) 
longitude of the X direction, i.e. the ascending node 
of the solar equator, was 74.37 degrees in 1900 and 
increases by 1 .4 degree/century. To further orient 
the HGI system, note that at the start of 1994 the 
Pioneer- 10 spacecraft moving down the tail of 
heliosphere was at zero degree HGI longitude. 

Geocentric Solar Ecliptic ( GSE) Coordinate 
System. The GSE system is a geocentric coordinate 
system with the Z axis northward from the ecliptic 
plane and the X and Y axes in the ecliptic plane. 
The X axis points toward the Sun from the Earth, 
the Y axis completes the right handed set. Often the 
GSE is used to refer to: (a) spacecraft centered 
system, which would be more appropriately called 
Spacecraft Solar Ecliptic (SSE), for a spacecraft 
that is near the ecliptic plane; and (b) other planet- 
centered system, e.g. Venus Solar Orbital (VSO) 
Coordinate System, where the X and Y axes are in 
the Venus orbit plane that is inclined about 3 
degrees from the ecliptic plane. 

RTN Coordinate System. The RTN system is 
centered at a planet or spacecraft. The R axis is radially 
away from the Sun. The T axis is the cross product of 
the Sun spin vector (North directed) and the R axis. 

The N axis completes the right handed set. Note that 
when the Earth is near its extreme heliolatitude 
excursion (e.g. +7.25 deg in early September) andits 
motion is parallel to the Sun’ s equatorial plane, the 
GSE and geocentric RTN systems line up, except that 
X = -R, Y = -T, and Z = N. 

3. ACCESS AND DISPLAY SOFTWARE 

Each data set has been created as a flat ASCII 
file with the logical record similar to the well- 
known OMNI data base. All parameters have been 
quality controlled (format, units, range, average 
values when possible, missing data, etc.), corrected 
(if necessary) and written in a similar format when 
possible (e.g. the SWP velocity values have format 
F6. 1 across all data sets). Missing values have been 
replaced by combination of digit ‘9’ in accordance 


with the format of given parameter (e.g. F6. 1 is 
filled by 9999.9). 

In addition to the ASCII versions of the files, 
CDF (Common Data Format) versions are also 
provided on the disk. These CDF files are for the 
convenience of users having CDF software. For 
further insight into CDF, s eeNSSDC CDF User’s 
Guide [1994] or call NSSDC’s CDF User Support 
Office (301-286-9884). The software described in 
the following paragraphs is MS-DOS based, and 
addresses only the ASCII versions of the data files. 

‘Menu driven’ and ‘user-friendly’ access and 
display software has been developed to review data 
coverage, to display data numerically and graphi- 
cally, and to create files (subsets of CD-ROM files) 
for migration to magnetic disk for further analysis. 
The Master Screen (Figure 1 ) provides a single 
selection of spacecraft, shows general documenta- 
tion, and allows a user to enter into the 
Intercomparison Software Package. This Package 
provides the ability to compare ‘stack-plot’ graph- 
ics of data from different spacecraft. The software 
will run in the ‘hourly’ or ‘5-minute’ modes accord- 
ing to the data set selected. 

The ‘hourly’ software works with any hourly 
resolution data set and offers the following options 
(see example on Figure 2): 

(a) display the Data Format Documentation 
File and provide a copy of this documenta- 
tion file to the user’ s output file; 

(b) display the Monthly Data Availability 
Information Table, i.e. the ‘up-to-one- 
month’ screen (Figure 3) shows how much 
data are available for a given spacecraft for 
that month (100% of available data is 
denoted as Y, 0% - N, the numbers from 0 
to 9 stand incrementally for each 1 0% of 
the data availability for a given day ); 

(c) show simultaneously on the same screen 
the Daily Data Availability Information 
Line (bottom panel on Figure 3) which 
describes the data availability (Y or N) for 
each UT hour of a given day; 

(d) display the data listing on the screen 
(Figure 4); 

(e) create an ASCII file of the selected param- 
eters and given time interval and display 
these parameters graphically; 
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(f) display data for an arbitrary number of 
days, from 1 day (24 hours) to 3 1 days 

(Figure 5); and 

(g) display a scatter plot, i.e. to show the 
dependence of one parameter on another 

(Figure 6). 

Note that the default plotting action is to plot 
time series about the mean value of the parameter 
for the interval, and to show the scale via the 
standard deviation measure to the left of the ordi- 
nate. Through the use of ‘L-Change level’ and ‘S- 
Change scale’ options in the display software, the 
user may fix the center line and the scale to meet 
his needs or taste. 

The ‘5-minute’ software provides similar 
options as the ‘hourly’ software. Some different 
display capabilities of this 5-minute mode are 
shown in Figure 7 (daily display) and Figure 8 
(crossing day boundaries). The Intercomparison 
Software Package shows the ‘spacecraft menu’ 
which provides the ability to select files of corre- 
sponding data (Figure 9) from multiple sources. 

The program merges these files and shows a ‘stack- 
plot' graphical display of selected parameters for 
comparison. There is an option that allows compari- 
son of the ‘hourly’ and ‘5-minute’ data on the same 
screen (Figure 10). 

The hardware/software environment needed for 
this application software is MS-DOS, 1 MB of 
usable memory, and any IBM-PC compatible 
monitor, although color monitors will have their 
color capability used by the software. Other sys- 
tems (UNIX, etc.) can access the ASCII files on the 
CD-ROM from their environments, although we 
provide no specific software support for such 


systems. Other systems with CDF software can 
access the CDF files on the CD-ROM; see earlier 
comments about CDF. 

4. CONCLUSION 

The ‘user-friendly’ approach and easy access to 
the interplanetary data, developed for the first 
NSSDC heliospheric CD-ROM, is consistent with 
the concept of providing a Personal Data Center on 
CD-ROM. The wide distribution and use of IBM- 
PC compatible computers and CD-ROM readers 
provide users easy access to the data previously 
stored in the World Data Center archives. This 
effort brings the World Data Center data bases to 
scientists in any corner of the world. 
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Figure I. Master Screen of the NSSDC Heliospheric CD-ROM. 
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Figure 2. An example of the 'hourly based' software menu. 
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Figure 3. Data Availability Information Table for 'hourly' data sets. 



Figure 4. Listing of hourly mean values of selected parameters. 
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Figure 5. The 'multi-day' presentation of ' hourly ’ mean data. 
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Figure 6. An example of the ' scatter plot' option. 
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Figure 7. An example of 'daily' variation of '5-minute' data. 
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Figure 8. An example of the ‘crossing day boundaries’ plots. 
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Figure 9. Intercomparison of data from different spacecraft. 
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Figure 10. Intercomparison of 'hourly' and ‘5-minute' data. 
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The research environment faced by space and atmo- 
spheric scientists in the 1990s is characterized by un- 
precedented volumes of new data, by ever-increasing 
repositories of unexploited mission files, and by the 
widespread use of empirical and large-scale computa- 
tional models needed for the synthesis of understanding 
across data sets and discipline boundaries. The effective 
analysis and interpretation of such massive amounts of 
information have become the subjects of legitimate con- 
cern. With SAVS (a Space and Atmospheric Visualiza- 
tion Science System ), we address these issues by creating 
a “push-button” software environment that mimics the 
logical scientific processes in data acquisition, reduc- 
tion, and analysis without requiring a detailed under- 
standing of the methods, networks, and modules that link 
the tools and effectively execute the functions. SAVS 
provides: (1) a customizable framework for accessing a 
powerful set of visualization tools based on the popular 
AVS visualization software with hooks to PV-Wave and 
access to Khoros modules, (2) a set of mathematical and 
statistical tools, (3) an extensible library of discipline- 
specific functions and models (e.g., MSIS, IRI, Feldstein 
Oval, IGRF, satellite tracking with CADRE-3, etc.), and 
(4) capabilities for local and remote data base access. 

The system treats scalar, vector, and image data, and 
runs on most common Unix workstations. We present a 
description of SAVS and its components, followed by 
several applications based on generic research interests 
in interplanetary and magneto spheric physics (IMP/ 
ISTP), active experiments in space (CRRES), and mis- 
sion planning focused on the Earth's thermospheric, 
ionospheric, and mesospheric domains (TIMED). 


1. INTRODUCTION 

1.1 An Integrated System 

SAVS represents an integrated system concept 
with a “push-button” approach to the access and 
implementation of tools necessary to : 

• search for, acquire, and read data; 

• establish satellite tracks, their relevant 
parameters, and their co-registrations with 
allied multi-spacecraft and ground-based 
diagnostics; 

• interactively interpret and analyze the 
findings; 

• compare the results with mature and/or 
developing models; and 

• creatively visualize the overall process and 
resulting scientific products . 

Much of the SAVS effort has been devoted to 
the development and implementation of a user- 
friendly architecture focused on ease and function- 
ality. Within our “push-button” approach, we have 
embedded a logical interactive data handling and 
analysis methodology, a mathematical toolbox, an 
interactive interpreter, and the AVS visualization 
software (Upson et al., 1989). As part of our effort, 
we are customizing AVS, its tools, and its operation 
for the NASA scientific user environment. Within 
this framework, we are creating custom functions, 
and integrating local and remote data base access, 
data translation, and formatting tools in order to 
provide scientific data manipulation, analysis, and 
display capabilities not available in the basic AVS 
system. 

1.2 Uniqueness of the Architecture audits End-to- 

End Approach 

The SAVS architecture recognizes that the 
scientist and mission design engineer work rou- 
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finely with data, theoretical models, and orbits 
(including orbital parameters, payload configura- 
tion, instrument fields-of-view, coordinated ground 
stations, etc.). At any given time they may work 
exclusively with models, data, or orbits, or combi- 
nations of all three. The most likely scenario 
involves an interactive combination of the three 
filtered through a mathematical interpreter and 
displayed through a broad set of visualization tools. 
This end-to-end approach of SAVS is illustrated in 
Figure 1 , where the products of the SAVS system 
serve a range of NAS A needs including: ( 1 ) data 
synthesis; (2) mission planning; (3) science and 
engineering trade studies; (4) data-model compari- 
sons; and (5) model test and validation. 

In mimicking the thought processes of the 
NASA scientist, the SAVS architecture has three 
entry concourses for daily activities: “Data,” 
“Models” and “Orbits.” By a “push of a button” in 
the SAVS front end scientists may enter any one of 
the three concourses and begin a procedure involv- 
ing S AV S-guided functionality, whereby they can 
exercise an unlimited number of loops in any given 
concourse, With this approach, the scientists can 
treat and overlay multiple model runs or data sets, 
and visualize the results in any number of formats. 


Visualization Techniques in Space and Atmospheric Sciences 

This SAVS concourse methodology and its generic 
products are illustrated in Figure 1 . 

The SAVS concept is designed for broad appeal 
and functionality since its architecture accommodates a 
general scientific approach to data access, 
manipulation, analysis, and visualization. Every 
scientific discipline deals with data, either in scalar, 
vector, or image format. Every discipline needs to 
develop, test, and apply empirical and first- principle 
model descriptions of cause-effect relationships. Every 
science needs to deal with the spatial and temporal 
coordinates of data sets (generally represented by a 
satellite’ s orbital elements, payload configuration, and 
instrument fields-of-view in a space science 
application); and every scientific discipline requires a 
suite of mathematical tools to handle data and prepare 
it for visualization. This is the generic system 
capability of SAVS, making it applicable to any 
scientific discipline. In the application described here 
SAVS is tailored to the needs of the space physics 
community with user interests in data from past 
programs (e.g., DE, IMP-8, ISEE, CRRES, etc.), in 
data provided by recent and imminent satellite 
operations (e.g., Wind, Polar, Geotail, etc.), and in 
effective planning of future missions (e.g. , TIMED, 
IMI, and Grand T our Cluster). 


SAVS: An End-to-End Approach with Interactive Functionality 



Figure 1. The SA VS entry concourses of “Data, " “Models, " and “Orbits " allow focus while providing flexibility for user-defined 
overlays of data, model, and experiment parameters in any number of visualization formats. The interactive functionality provides 
an end-to-end approach to enhanced scientific productivity. 
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1.3 The Visualization Software 

Underlying data handling and analysis in the 
SAVS system is the AVS visualization software. 
Designed for a distributed network environment, 
AVS supports visualization applications running on 
a single system or across a network of systems. It 
provides a complete image display capability, 
including real-time pan and zoom, rotation and 
transformation, flip-book animation, and support 
for 8-bit, 24-bit, and floating point images. Imaging 
filters include operations, such as contrast stretch- 
ing, pseudo-coloring, and histogram balancing, as 
well as data resizing operations, such as interpola- 
tion, cropping, and sampling. AVS provides tools 
for rendering volume data, a real-time isosurface 
generator, a unique transparent volume Tenderer 
which creates real-time semi-transparent images 
with full rotational and lighting control, and genera- 
tion of geometric objects such as arbitrary slicing 
surfaces, dot surfaces, and vector nets. The software 
is based on industry-standard graphics, windowing, 
and programming interfaces including X-Windows 
and PHIGS+. (For details contact P. Esdale at AVS, 
Inc., viapaule@avs.com.) 

With AVS as the pivotal visualization tool, 
SAVS and its users also benefit from the extensive 
AV S module library at the International AV S 
Center (IAC) at the North Carolina Supercomputer 
Center (see NCSC/IAC in references), which serves 
as a catalyst for increasing AV S product functional- 
ity. As a worldwide repository available at no cost 
to AVS/SAVS users, the center collects, sorts, and 
distributes user-contributed public-domain mod- 
ules. An example of existing aspects of this re- 
source is the availability of 229 Khoros-derived 
AVS modules (Myerson and Reid, 1993). Khoros, 
like AVS, is a data-flow-oriented application, but it 
makes available to AVS and SAVS its one-dimen- 
sional and two-dimensional signal processing 
algorithms, complementing the well-established 
three-dimensional volume visualization capabilities 
in AVS. 

2. THE “PUSH-BUTTON” FRONT END 

ARCHITECTURE 

The background plate in Figure 2 shows 
portions of the opening SAVS panels. The top- 


center of the main panel provides a “push-button” 
entry to any of the three SAVS concourses (Data, 
Models, and Orbits). The main panel also includes 
the display window (which is hidden by the front 
panel) where objects and object overlays are 
presented and where discrete controls can be used 
to manipulate the scene with definable increments 
of rotation, translation, magnification, and 
reduction. 

The pull-down menu bar (at the top of the main 
panel ... “controls,” “filters,” etc.) provides access 
to a wide spectrum of functions: everything from 
general application needs to math and visualization 
tools for multi-variant/multi-dimensional data sets. 

The applications panel (vertical panel on left) 
provides all parameter controls for each model run, 
for math tools used, and for the visualization 
process exercised. Behind these controls, and most 
SAVS buttons, are tailored networks transparent to 
the user, enabling him to easily assemble the 
necessary components for problem analysis without 
knowledge of the AVS-network building require- 
ments faced by non-SAV S users. 

The front panel in Figure 1 illustrates the 
staging process in a user’ s entry and exercise of the 
“Models” concourse. Selection of “Models” pre- 
sents a choice browser at the top-left of the display 
window which allows the user to search through 
and select any number of models, including: ( 1 ) 
those in the SAVS library, (2) those which are user- 
unique and implemented within SAVS by built-in 
hooks, or (3) those accessed through remote proce- 
dure calls. In the current SAVS library are: (1) the 
Feldstein auroral oval model (Holtzworth and 
Meng, 1975), (2) the International Geomagnetic 
Reference Field (IGRF, Peddie, 1982), (3) the 
International Reference Ionosphere (IRI, Rawer and 
Ramanamurty , 1 985), (4) the MSIS model represen- 
tation of thermospheric densities and composition 
(Hedin, 1 983), (5) the WIND model of thermo- 
spheric winds (Hedin, 1991), and (6) files of MHD 
model runs of the magnetosphere (Fedder and 
Lyon, 1987). 

Selecting WIND (as an example of a SAVS 
application scenario) presents the user with: (1) an 
automatic visualization default selection of input 
parameters and a corresponding default 
visualization format (see pull-down radio buttons 
under the “Models” concourse button in the front 
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Figure 2. The opening SAVS panel (background plate) gives the user access to the "Data, ” "Models, ” and “ Orbits ” concourses, 
along with push-button utilities for data and model handling and visualization techniques. The front plate is an example of a SAVS 
application scenario discussed in the text. 


panel of Figure 2), (2) the default object in the 
display window, and (3) the control panel on the 
left which allows the user to tune the model and its 
image according to all its driving variables. For 
WIND, the input variables include dial controls for 
universal time (UT), the activity index (Ap), and 
the 1 0.7 cm solar flux (F10.7). The control panel 
also offers keyboard controls for selection of 
computational resolution, altitude for a surface 
display of the thermospheric wind vectors, day-of- 
the-year. and scaling of N-S and E-W components 
of the velocity vectors. The user can pan, zoom, or 
rotate the image, exercise a parameter study by 


“instantaneously” changing the input conditions and 
observing the results, or begin to overlay data or 
other models (e.g., a color rendering of Cb and N 2 
densities from MSIS, the boundaries of the auroral 
oval according to Feldstein (with controls over UT 
and Kp), the IGRF to study meridionally- or 
zonally-coupled regions, or data for model- 
measurement comparisons). An illustration of 
multiple overlays will be taken up in the discussion 
ofFigure 4. 

In the case of model-measurement comparisons, 
SAVS has a specialized module that projects the 
results of any three-dimensional model onto any 
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one-dimensional line (Szuszczewicz, et al. , 1 993), a 
requirement for comparisons of along-track satellite 
observations with large-scale three-dimensional 
model predictions. Under the “Orbits” concourse, 
the user can call up any predefined mission orbit or 
one defined by the theoretical CADRE-3 satellite 
ephemeris package (Galperin, 1990) imbedded in 
the SAYS “Orbits” concourse. The user can then 
implement the three-dimensional to one-dimen- 
sional interpolation module by activating the 
“three-dimensional to one-dimensional” item in the 
“filters” submenu of the menu bar. 

3. DATA ACCESS 

The pivotal point of any analysis effort involves 
the data, with the SAVS architecture making 
available the tools to browse, access, and read data 
from local and remote repositories. While data 
being accessed by NASA researchers exist in an 
exceedingly wide range of formats and organiza- 
tional levels, HDF (hierarchical data format), CDF 
(common data format), and netCDF are developing 
broad appeal. HDF, for example, is the baseline 
standard for EOSDIS data product generation, 
archival activities, ingest, and distributions. And 
CDF has been adapted by ISTP for many of its data 
products. With this perspective, we are providing 
HDF, CDF and netCDF readers, while including a 
set of “hooks” so that users can include readers for 
“non-standard” data formats. 

To access data from national and international 
sites, we are integrating browsing and remote 
procedure calls into SAVS via adaptations of a 
menu-oriented data base front-end. Users, with 
widely varying applications, can therefore act as 
their own data retrieval service, thus relieving a 
longstanding bottleneck connected with poor user 
access to the data and dependence on data centers 
for satisfying their requests. While this element of 
S AV S is still under development, a prototype 
system has been implemented based on the 
relational data base management techniques 
employed in S AIC’s CenterView (Brennan, 1987). 
CenterView has a history of more than ten years of 
development efforts and is routinely used as the hub 
of a worldwide network which acquires, stores, and 
analyzes an active real-time global seismic data 
base. 


We illustrate the ease and utility of this SAVS 
element of remote data access, which we call 
SAVS-View, in an application that can be thought 
of as generic to interplanetary and magnetospheric 
physics. Working with the National Space Science 
Data Center (NSSDC) at the NASA Goddard Space 
Flight Center we set up a remote library of data and 
developed an index system and a corresponding set 
of “keys” which would allow a user to browse the 
index catalogs and retrieve the desired subset of 
data from the library. This browse-and-retrieve 
activity involved a number of successful tests and 
demonstrations, one of which employed an IBM 
RS6000 at the University of Colorado linked with 
an Oracle-based network server at NSSDC. 

The background panel in the upper plate of 
Figure 3 shows the set of keys that our “test user” 
activated in establishing a query of available data in 
our library. The panel in the foreground shows the 
resulting list of data which matched his index of 
“keys” after having launched the query. In this case, 
the user was interested in solar wind input to the 
magnetosphere and wanted to build an empirical 
picture of that energy using available data and the 
solar wind energy formalism represented by the 
epsilon parameter (Perrault and Akasofu , 1 97 8 ; 
Vasyliunas et al., 1 982) which has the form 
e = l2vB 2 Sin 4 (0/2). In this relationship vis the 
solar wind velocity, B is the value of the 
interplanetary magnetic field, 0 is the polar angle 
of the component of the IMF normal to the Sun- 
Earth line and measured from the northward 
geomagnetic axis, and 1 0 is a constant. In order of 
selection, the user struck the “data” key, and then 
the “subcategory” key which allowed his choice 
or choices of measurements. Here, he selected the 
total B-field, the solar wind velocity and density, 
and the B z -component of the IMF as an indicator 
for 0. His data source of interest was IMP-8, so he 
keyed “Missions” and selected IMP-8, and then 
launched the query using the pull-down query menu 
at the top of the panel. The entire effort, including 
the time for SAVS-View to find and list the 
available data, required approximately two minutes. 
The transfer of those files and the presentation of 
the B and B z parameters using the SAVS two- 
dimensional stack plot capabilities are presented in 
the lower plate in Figure 3. The formatting of the 
data plots took advantage of S AV S tools that 
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allowed the selection of any parameter in the file 
for the abscissa and the ordinate. SAVS also allows 
truncation of the individual plots, as well as their 
extensions along any user-selected domain of the 
independent variable. 

In the development of data access tools, current 
SAVS activities also include the implementation of 


a SAVS utility to access the Logical Library 
System (LLS) developed at the NASA Goddard 
Space Flight Center (Jacobs, 1993). The LLS, as the 
name suggests, has the functionality of a library, 
with programs, missions, events, etc. catalogued 
within the framework of library shelves (e.g., 
magnetospheric physics), books (e.g., the DE and 



Data Category: Measurem ents | 

Data Subcategory 

~ B-fietd !“ Solar Wind (Rfeahay Veldcay B 


E -field _ Solar Wind (Thermal) Velocity 

Ion Composition - Solar Wind Density 

Mission Name 

DE-1 DE-2 @ IMP-8 




Figure 3. The upper plate shows the set of "keys" that a test user activated in establishing a query of available data at a remote 
site. The bottom panel shows a subset of the data acquired, manipulated and displayed within the SA VS system. 
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IMP-8 missions), chapters (e.g., mission years or 
event categories), and pages (e.g., specific magnetic 
storms). This system takes advantage of the GSFC/ 
NSSDC Master Directory and will eventually make 
data at the center available online to S AVS calls. 

4. VISUALIZATION APPLICATIONS TO 

MISSION SCENARIOS 

We demonstrate the interactivity and extensibil- 
ity of the SAVS system with an application sce- 
nario that can be considered generic to any mission 
planning exercise or to an activity involving data 
reduction and analysis that requires: (1) a number 
of models, (2) ground-based and spaceborne data, 
(3) “in situ” and remote sensing techniques, (4) 
perspectives on fluxtube coupled domains, and (5) 
co-registration of satellite-borne and ground-based 
diagnostics. 

A strawman scenario for a given mission is 
presented in Figure 4. The scenario has counterparts 
in the CRRES, TIMED, or ISTP missions, requiring 
the coordination of satellite passes with ground- 
based optical, ionosonde, and radar diagnostics. The 
“Models” concourse was used in generating the 
object in the left panel in Figure 4 by overlaying: 

( 1 ) thermospheric winds (blue vectors) and oxygen 
densities (global color-coded surface) at 350 km 
using the SAVS library models WIND and MSIS, 

(2) the auroral oval boundaries from the Feldstein 


model, (3) altitude cut-planes of ionospheric 
densities from the IRI in the latitude and longitude 
planes intersecting the coordinates of the Arecibo 
Observatory, and (4) geomagnetic field lines (in 
white) from the IGRF. The satellite trajectory was 
generated in the “Orbits” concourse using the 
CADRE-3 package. 

With push-button and dial controls the user can: 
(1) adjust the auroral oval for any magnetic activity 
index or universal time, (2) change the heights of 
the thermospheric wind and density calculations, 

(3) vary the activity index Ap, the solar F10.7 flux, 
and the day-of-the year input drivers for WIND and 
MSIS, (4) alter the year for the IGRF, (5) tune the 
month and sunspot number for the IRI, and 
(6) change the coordinates for the intersecting cut- 
planes (with interest, for example, in high-latitude 
passes in conjunction with Sondrestrom radar 
operations in Greenland). The cut-planes could also 
be in magnetic E-W and N-S directions instead of 
being aligned with the geographic contours of 
latitude and longitude at the Arecibo Observatory. 

From the CADRE-3 orbit specifications the user 
can develop stack plots of any number of relevant 
parameters versus UT (or for example, versus 
latitude and longitude, or versus MET, or versus 
LT). Parameters available to the user include: (1) 
SZA, MLAT, MLONG, MLT, and L-shell value, (2) 
point of conjugacy, (3) spacecraft velocity vector 
relative to the ambient B-field, (4) height of the 



Figure 4. A visualization application of SA VS to a mission scenario with counterparts in the CRRES, TIMED, and ISTP missions. 
The left panel provides a global view, while the right panel focuses on the conjunction between the satellite pass and the radar 
field-of-view at the Arecibo Observatory. 
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terminator relative to the satellite track, (5) B -field 
values or values of its E-W and N-S components, 

etc. 

The user can also call-up the three-dimensional 
to one-dimensional module to interpolate any and 
all model results along any segment of the satellite 
track and through the “Data” concourse compare 
the results with local or remote data bases of along- 
track measurements. 

For co-registration with ground-based 
diagnostics, the right panel of Figure 4 presents a 
zoomed-in perspective on the Arecibo site that 
includes the projection of the ground-based HF 
radar beam up to an altitude of 400 km. That 
projection is developed and displayed using the 
S A V S “field-of-view” module that is controllable 
by the user to represent any and all ground-based 
remote sensing diagnostics. The user need only 
input the coordinates of the ground site, the azimuth 
and elevation of the instrument’ s line-of-sight, and 
its beam-width (or acceptance) half-angles in the 
N-S and E-W directions. 

Inspection of the right panel of Figure 4 shows 
that the satellite (the black sphere) at the given UT 
in its trajectory is not on a fieldline that connects to 
the region diagnosed by the ground-based radar. 
With animation and pause controls, the user can 
determine the exact time of fieldline-coupling and 
begin a number of S AVS-controlled operations. For 
example, the three-dimensional to one-dimensional 
module can be used to interpolate ionospheric and 
thermospheric model densities, temperatures, and 
composition onto the connecting fieldline, and the 
model data used as inputs exercised to calculate 
fluxtube-integrated conductivities from the satellite 
to the region covered by the ground-based diagnos- 
tics. The module could also be used to calculate 
integrated densities or emissions along the field-of- 
view of the ground-based sensor. The field-of-view 
module can also be “attached” to the spacecraft to 
project and visualize the field-of view of an 
onboard remote sensing device. In this case, the 
animation and visualization product could focus on 
the determination of the time of intersection of the 
fields-of-view of the satellite and ground-based 
sensors. The scenarios are unlimited. Models can be 
tuned and retuned, and results visualized in simple 
stack plots of along-track correlations for compari- 
sons with data. Visualizations can include constant 
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altitude surfaces of any geophysical parameters 
(e.g., N e , T e , and densities of 0+, N 2 +, ..., N n , T n , 
and densities of O, N 2 , ..., thermospheric wind 
vectors and their meridional and zonal components, 
etc.). Instead of constant altitude representations, 
the user can choose altitude cut-planes, isosurfaces, 
or isocontours. The proper visualization product is 
only determined by the perceptive and inquisitive 
scientific mind, with the freedom of the push-button 
operations of the SAVS system making the choices 
immediately available. 

5. COMMENTS AND CONCLUSIONS 

As described, the SAVS architecture accommo- 
dates a generic scientific approach to data access, 
analysis, and visualization with tools that provide 
online mathematical analysis (e.g., averaging, 
interpolating, spectral analyses through FFTs, etc.) 
along with one-, two-, and three-dimensional 
displays that can include animation. Data displays 
and model results can be rendered in an array of 
two-dimensional stack plots with the user exercis- 
ing control of abscissa and ordinate selection from 
the full set of parameters in the data file. The 
observations and the model results can be rendered 
in color codes, isolines, isosurfaces, cut-planes, and 
texture maps, with controlled lighting and transpar- 
ency. The three-dimensional to one-dimensional 
interpolation modules can be used for along-track 
model-measurement comparisons, field-line- 
integrated calculations, or intensity calculations 
along the line-of-sight of remote sensing devices. 
And the field-of-view module can be used to 
establish regions of co-registration involving multi- 
satellite missions and ground-based observatories. 

The current system has been developed and 
tested using a relatively large number of models 
covering magnetospheric, ionospheric, and thermo- 
spheric physics (see e.g., Szuszczewicz et al, 1 993 
and Szuszczewicz, 1 994). And tests involving the 
data readers and local and remote data access tools 
have included the CRRES, IMP-8, DE, and the 
ISEE missions. 

The push-button architecture is intended to 
service user needs focused on science, and not 
graphics network building and programming. The 
complexities of the tools are in the background as 
specialized and tailored modules within the S A V S 



Chapter II: Space and Upper Atmospheric Science Applications 

system framework. The “expert user,” however, can 
still access the foundations of AVS and build his or 
her own graphics network. 

While SAVS includes a library of models and 
data readers, it is also providing the hooks and 
menu driven recipes for user-unique formats and 
models in order to guarantee flexibility and extensi- 
bility in the overall system. The integrated system 
is generic in its capabilities with natural applica- 
tions to ISTP, TIMED, EOS, and TRMM, to name 
only a few of the near-term large-scale science 
programs at NASA. 
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The International Solar-Terrestrial Physics Program 
(ISTP) is a multi-spacecraft, multi-national program 
whose objective is to promote further understanding of 
the Earth’s complex plasma environment. Extensive 
data sharing and data analysis will be needed to ensure 
the success of the overall ISTP program. For this reason, 
there has been a special emphasis on data standards 
throughout ISTP. One of the key tools will be the Common 
Data Format ( CDF), developed, maintained, andevolved 
at the NSSDC, with the set of ISTP implementation 
guidelines specially designedfor space physics data sets 
by the Space Physics Data Facility ( associated with the 
NSSDC ). The ISTP guidelines were developed to facilitate 
searching, plotting, merging, and subsetting of data sets. 
We focus here on the plotting application. A prototype 
software package was developed to plot Key Parameter 
( KP) data from the ISTP program at the Science Planning 
and Operations Facility ( SPOF). The ISTP Key Parameter 
Visualization Tool is based on IDL and is keyed to the 
ISTP guidelines, reading data stored in CDF. With the 
combination of CDF, the ISTP guidelines, and the 
visualization software, we can look forward to easier 
and more effective data sharing and use among ISTP 
scientists. 


1. INTRODUCTION 

The International Solar-Terrestrial Physics 
Program (ISTP) is a multi-spacecraft, multi- 
national program whose objective is to promote 
further understanding of the Earth’ s complex 
plasma environment. The first ISTP spacecraft, the 
Japanese-U.S. (ISAS-NASA) Geotail spacecraft, Is 
now in orbit. Two NASA spacecraft, WIND and 
POLAR, are slated for 1994 launches. The fleet of 
spacecraft also includes several equatorial space- 
craft: (NOAA) GOES spacecraft and (DOE) LANL 
spacecraft. IMP-8 supplements these observations. 
Ground-based data from DARN, SESAME, CANO- 
PUS, and Sondestromfjord, and theoretical studies 
and modeling complement the spacecraft data. In 
addition, NASA is collaborating with the European 
Space Agency (ESA) in two additional missions: 
SOHO and Cluster. The Russian Space Research 
Institute (IKI) is participating through the Inter- 
Agency Consultative Group with the Interball 
mission. All of these missions together will study 
the generation, flow, and dissipation of mass, 
momentum, and energy between the Sun and the 
Earth. 

Extensive data sharing and data analysis will be 
needed to ensure the success of the overall ISTP 
program. For this reason, there has been a special 
emphasis on data standards throughout ISTP. One 
of the key tools will be the Common Data Format 
(CDF), with the set of ISTP implementation guide- 
lines specially developed for space physics data sets 
by the Space Physics Data Facility (associated with 
the NSSDC). CDF (along with the ISTP guidelines) 
is a means of storing data and metadata (description 
of the data) in a standard, readily accessible format. 
CDF, developed, maintained, and evolved at the 
National Space Science Data Center (NSSDC), is a 
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management tool for scientific data sets. Data are 
organized into an interlocking grid structure with 
standard labels, time tags, dependencies, uncertain- 
ties, and offsets explicitly defined for later search- 
ing, plotting, merging, and subsetting. CDF has a 
high-level programming and toolkit library which 
provides functionality without excessive program- 
ming and is portable to a wide variety of platforms. 
CDF data sets are transportable between any of the 
platforms that CDF supports (VAX, Sun, SGi, IBM, 
HP, and PC). CDF is freely available to the public, 
and will be supported at least throughout the 
lifetime of ISTP and the IACG campaigns. 

Many tools now exist or will be designed and 
implemented in the future for use with CDF. 
Additionally, there are specific tools designed and 
planned for CDF data sets which include the 
standard set of ISTP descriptions. All of these are 
shown in Figure 1 . This figure is divided into three 
sections: creation, visualization, and manipulation. 


The C and Fortran programming interface spans all 
of these. The ISTP Guidelines add a layer of 
specific metadata (attributes) to the CDF data sets; 
the guidelines are shown in blue next to the C and 
Fortran programming interface in Figure 1 . The 
tools in the three sections use either generic CDFs 
or CDFs with ISTP Guidelines. We briefly describe 
each of these tools. The CDF toolkit, which spans 
all three sections, is part of the generic CDF distri- 
bution and includes routines for assisting in the 
creation of CDFs through the use of skeleton tables 
(section 2), tools to list and browse the data values 
and metadata (visualization), and tools to edit and 
convert CDFs (manipulation). These tools work 
well with any CDF. CDF data sets can be created 
just using the C or Fortran programming interface, 
although the process is simplified by using the 
combination of skeleton tables (CDF toolkit) and 
programming. The creation process is made even 
easier through a prototype tool called Make_CDF, 
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Figure I. CDF associated tools and programs: white background indicates tools in the planning stage, light colored background 
indicates software development in progress, and somewhat darker background indicates prototype software. 
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which also uses the ISTP Guidelines (shown in 
yellow in Figure 1). With a data set in hand, a user 
describes the input and output format through a user 
interface, and the tool then creates the CDF data 
set. No programming is required. Other tools built 
on top of CDF and the ISTP Guidelines include a 
package which is being developed to validate 
whether or not a particular CDF meets the ISTP 
guidelines (i.e., whether or not all of the required 
metadata is included), a Merge and Subset package 
which is in the planning stage, and the ISTP Key 
Parameter Visualization Tool described below. This 
last tool presently plots time series key parameter 
data via the Interactive Data Language (IDL). There 
are other packages which make use of the CDF 
library, but do not take advantage of the metadata 
included in files designed with the ISTP Guidelines. 
For example, three visualization tools which make 
use of CDF are shown in Figure 1 : CDF X-Win- 
dows Image Tool (CXIT, available free from the 
NSSDC) for viewing image data; NSSDC Graphics 
System (NGS, available online at NSSDC) for 
plotting data from the Coordinated Data Analysis 
Workshop (CDAW), and the Advanced Visualiza- 
tion System (AVS) which is a commercial package 
for plotting multi-dimensional data. In addition, 

IDL has a built-in CDF interface, as well as access 
to the programming library. 

The ISTP Key Parameter Visualization Tool is 
a prototype software package (version 1.1) devel- 
oped at the Science Planning and Operations 
Facility (SPOF), for plotting ~1 minute resolution 
survey data from the ISTP program (referred to as 
Key Parameter (KP) data). This tool consists of two 
parts: one written in Fortran that is responsible for 
reading the data and metadata; and one written in 
IDL with widgets that provides the data processing 
and plotting capabilities. In addition, this tool has 
its own custom interface between IDL and the CDF 
software, and does not use either the CDF library 
provided by IDL version 3 .0 nor the IDL interface 
programs provided by CDF version 2.3. These 
versions were not available when the tool was first 
designed, and then it was decided to preserve the 
independence of the custom interface to allow for 
use with PV-Wave or other commercial plotting 
packages. The data files must be in CDF and 
compliant with the ISTP standard in order to be 
plotted by the visualization tool. The tool uses the 
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metadata as outlined in the ISTP guidelines: labels, 
units, scales, etc. The visualization tool is available 
to ISTP investigators through the Central Data 
Handling Facility (CDHF) on an as-is basis with no 
support. The visualization tool was developed on a 
DEC workstation running Ultrix and later ported to 
a Sun Workstation. On both platforms, IDL with an 
X-Windows environment, is required. 

At this time, and for some time in the future, 
the ISTP data sets are proprietary. It is envisioned 
that the non-proprietary CDAW-8 and CDAW -9 
data which were originally cast in CDF will be 
updated to embrace the ISTP guidelines and can 
then take full advantage of the visualization tool. 
Other investigators will be able to take advantage of 
these tools by putting data into CDF with the ISTP 
guidelines. We provide in this paper a discussion of 
how to create ISTP-like CDFs, and how to visualize 
the data. The plots are either time series data or 
hodogram type plots. We show a sample plot and 
discuss the planned extensions. 

2. CREATION OF ISTP-LIKE CDFs 

ISTP CDF data sets are organized as daily files, 
with separate files for each experiment on each 
spacecraft and for each ground-based station. The 
data are, for the most part, one minute averaged 
time series scalar and vector data, although some 
images at a resolution of approximately 5 minutes 
are also being provided. For the new spacecraft 
within ISTP, the orbit and attitude files are being 
provided separately (also in CDF), whereas the 
previously launched spacecraft (Imp-8, GOES, 
LANL) provide the orbit information along with 
their data variables. Each ISTP CDF data set 
includes both data and descriptions of the data 
(metadata). The descriptions include header infor- 
mation (e.g., number of variables, number of 
attributes, dimensions, data encoding), global 
information about the data set (and experiment) as a 
whole (global attributes) and information carried 
with each variable (variable attributes). For ISTP, 
there is a standard set of attributes to promote easier 
and more effective data sharing, referred to as the 
ISTP guidelines. Users expect and will receive the 
information required to both understand and ana- 
lyze each data set. In a short paper we cannot 
provide all of the details necessary to create a CDF 
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data set adhering to the ISTP guidelines. Instead, 
we give the flavor of the creation process through 
analogy. The Make-CDF interface, mentioned 
previously, eliminates the coding step discussed 
below. We also list the complete set of ISTP 
standard attributes associated with a scalar variable. 
Some of these attributes are not related to plotting, 
and some have not been incorporated into the 
present release of the visualization software. 

2.1 CDF Analogy 

The process of creating a CDF data set is 
similar to the process of building a house (see 
Figure 2). The number of floors in a house is 
similar to the dimensions of the CDF, and the 
rooms are analogous to variables. Furniture and 
possessions are analogous to data. For the ISTP 


CDF designer, the data itself is always crucial to the 
design of the CDF. The data and the ISTP Guide- 
lines together dictate the design. The global features 
of a house (such as fireplaces, decks, garage) are 
similar to the global attributes that define the data 
set as a whole. Particular room features (closets, 
outlets, windows, sinks, bathtubs) are analogous to 
the variable attributes; which attributes to include 
depends on the nature of the variable. Just as the 
blueprints are the plans for a house, the skeleton 
table is the plan for the particular CDF data set. 

Once the blueprint is finished, the building 
stage can begin. This is usually the most time 
consuming part of the process for both houses and 
CDFs. For CDFs, the building stage involves 
changing the ASCII skeleton table description into 
a binary version and writing the computer code to 
add the data to the description. The final stage 
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Figure 2. Creating a CDF data set in analogy to building a house. 
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involves moving furniture into the house or data 
into the CDF (running the computer code). Once 
the furniture has been moved in, the house is ready 
for occupancy; when the data is put into the CDF, 
the CDF data set is complete and can be viewed, 
ported to another location, manipulated, or visual- 
ized via plotting software. 

2.2 Standard Attributes 

Variable-scope-attributes are linked with each 
individual variable, and are an important means of 
providing additional information about each vari- 
able. A standard set of these attributes (required for 
every variable) is very important, for this is where 
certain required information can be stored in a 
commonly defined manner. Any application pro- 
gram can utilize the information, but the CDF data 
set designer does not need to know the details of the 
application program. The specific variable-scope- 
attributes in the ISTP standard list for scalar data 
are listed in Table 1 and defined below. 

FIELDNAM (required) holds a character string 
(up to 30 characters) which is longer and more 


Table 1. ISTP Standard Set of Variable-Scope-Attributes for 
a Scalar Variable. 


Scalar Variable Attributes for "Density" 


Attribute Name 

Type 

Example of Attribute Value 

Required Attributes 


FIELDNAM 

CDF.CHAR 

Proton No. Density 


LABLAXIS 

CDF_CHAR 

Np 


UNITS 

CDF_CHAR 

no/cc 

* 

VALIDMIN 

CDF_REAL4 

0.0 

* 

VALIDMAX 

CDF_REAL4 

50.0 

* 

SCALEMIN 

CDF_REAL4 

0.0 

* 

SCALEMAX 

CDF_REAL4 

50.0 

* 

FILLVAL 

CDF_REAL4 

-1.0E31 


FORMAT 

CDF_CHAR 

F6.2 

* 

DEPEND_0 

CDF_CHAR 

Epoch 


VARJTYPE 

CDF_CHAR 

data 


DICT_KEY 

CDF.CHAR 


Optional Attributes 


CATDESC 

CDF_CHAR 

Proton number density determined 
from a moment calculation 


SCALETYP 

CDF_CHAR 

linear 


AVG_TYPE 

CDF.CHAR 

standard 

• 

DELTA_PLUS_VAR 

CDF_CHAR 

uncertainty 

• 

DELTA_MINUS_VAR 

CDF_CHAR 

minus_uncertainty 

“ 

OFFSET_0 

CDF.CHAR 

time_offset 


• Indicates that these are pointer class attributes which have as 
theirvalue anothervariable in the CDF dataset. These attributes 
are used to link the parent variable, in this case, Density, to the 
“attachedvariables”: Epoch, uncertainty, minus uncertainty, 
and time_offset. The * indicates that these attributes are of the 
same data type as the variable. 


descriptive than the name of the variable. It can be 
used to label a plot either above or below the axis, 
or can be used as a table heading. Therefore, 
consideration should be given to the use of upper 
and lower case letters where the appearance of the 
output plot or table heading will be affected. 

LABLAXIS (required) should be a short character 
string (no more than 1 0 characters, but preferably 6 
characters) which can be used to label a y-axis for a 
plot or to provide a heading for a table. 

UNITS (required) is a character string (no more 
than 20 characters, but preferably 6 characters) 
representing the units of the variable, e.g., nT for 
magnetic field. If the standard abbreviation used is 
short then the units value can be added to a table 
heading or plot label. 

VALIDMIN and VALIDMAX (required) hold 

values which are, respectively, the minimum and 
maximum values for a particular variable that are 
expected over the lifetime of the mission. 

SCALEMIN and SCALEMAX (required) are 
values which can be based on the actual values of 
data found in the CDF data set or on the probable 
uses of the data, e.g., plotting multiple files at the 
same scale. The visualization software uses these 
attributes as defaults for plotting, but lets the user 
override the default scale. 

FILLVAL (required) is the number inserted in the 
CDF in place of data values that are known to be bad 
or missing. Fill data are always non-valid data. 

FORMAT (required) is the output format used 
when extracting data values out to a file or screen 
(using CDFlist). The magnitude and the number of 
significant figures needed should be carefully 
considered. A good check is to consider it with 
respect to the values of VALIDMIN and 
V ALIDM AX attributes . The output format can be 
in Fortran or C, but Fortran is preferred. 

DEPEND_0 (required for time- varying vari- 
ables) explicitly ties a “parent” variable to the time 
variable on which it depends; the value of the 
attribute is the time variable. For ISTP data sets, the 
data is at one time resolution and the time variable 
in each data set is called “Epoch” (stored as milli- 
seconds AD, but displayed as year, month, day, 
hour, minute, second). 

VAR_TYPE (required) identifies a variable 
as either data (integer or real numbers) or as 
metadata (labels or character variables) . 
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DICTJKEY (required) comes from a data 
dictionary and describes the variable to which it is 
attached. (At this time ISTP does not have a 
completed data dictionary; this attribute is included 
as a placeholder only and at this time does not 
affect visualization.) 

CATDESC (optional) (catalog description) is 
an 80-character-max string which is a textual 
description of the variable. This attribute should 
be used whenever there is possible confusion in 
the meaning of the variable or when 
FIELDNAM is not long enough to store the 
information. 

SCALETYP (optional) indicates whether the 
variable should have a linear or a log scale as a 
default. If this attribute is not present, linear scale 

is assumed. 

AVG_TYPE (optional) sets up useful default 
conditions: different techniques appropriate to 
averaging different types of data. If this attribute is not 
present, standard average, i.e., simple arithmetic 
mean, is assumed. For other options, see [ 1 ] . 

DELTA JPLUSJVAR and DELTA_MINUS_VAR 
(optional) are included to point to a variable (or 
variables) which stores the uncertainty in (or range of) 
the original variable’ s value. The uncertainty (or range) 
is stored as a (+/-) on the value of the original variable. 
For many variables in ISTP, the original variable will 
be at the center of the interval, so that only one value 
(or one set of values) of uncertainty (or range) will 
need to be defined. In this case, DELTA_PLUS_VAR, 
and DELTA_MINUS_VAR will point to the same 
variable. 

QFFSET_0 (optional) is used as a way to carry 
multiple time resolutions or multiple time tags 
offset from each other in a file, while maintaining 
only one time that is the record ordering parameter. 
The variable which holds the time offset(s) is the 
value of the attribute. 

3. VISUALIZATION OF ISTP-LIKE CDFs 

The visualization software is best described 
through a demonstration of its user interface. 

Figure 3 shows the IDL widget layout for the 
software. There is a main panel which consists of 
four sections labelled A through D in Figure 3. 
Section A is the structure setup which is a series of 
buttons with a number of options; these may be 
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selected in any order. Users choose the number of 
panels to plot and the type of plot: either a time 
series or a variable vs. variable plot. Users choose 
to plot either ISTP CDFs or Other CDFs. Users also 
choose one of three options: to plot variables from a 
single CDF (single CDF per panel per plot); to 
allow variables from multiple CDFs to be plotted, 
but only from a single CDF on any one panel 
(single CDFs per panel and multi CDFs per plot); or 
to plot from more than one CDF for any panel 
(multi CDFs per panel per plot) . 

Section B shown in Figure 3, is a series of 
selections each of which require another screen. 

The selections must be made in order from top 
down as shown in Figure 3, and listed below. A 
warning message will appear if one of the 
selections is skipped. 

a Select the number of key parameters per 
panel, up to two is allowed. 
b Select the spacecraft or ground-based 
investigation (ISTP mission). 
c Select the ISTP experiment. 
d Select the ISTP CDFs. 
e Select the ISTP key parameters (for time 
series plot). 

For each of these selections, more choices are 
presented on subsequent screens; users are 
presented with choices so that they are not required 
to type in values which may not be known apriori. 
Two of these screens (c and d) are context sensitive, 
i.e., if IMP-8 is chosen as the spacecraft in b, then 
only two experiments appear in c: the 
magnetometer and the plasma experiment, because 
these are the only two instruments on IMP- 8 
presently supplying data to ISTP. For d, only IMP-8 
magnetometer CDFs appear because the 
magnetometer was chosen. The user is presented 
with a hierarchy of choices, where each choice 
limits the following choice to only those CDFs of 
interest. For this example, a single CDF per panel 
per plot was chosen in Figure 3. If a multi CDF 
option was chosen, then more than one choice is 
allowed on following panels. The final selection, e, 
is of the key parameters to be plotted on each panel. 
The user is presented with only the choices that 
follow from those made before. In this example, 

3 panels were chosen with a single CDF per panel 
per plot so it possible to choose only from this one 
CDF. The key parameters available are all listed so 
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Figure 3. Visualization Software. 
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that users do not have to know what variables are in 
the file, or the names of the variables. It is possible 
to customize the plot from the graphics setting 
menu, as shown in Figure 3 f. The user selects the 
plot style, the axis scaling (linear or log), and the 
number of tick marks. It is not necessary to look at / 
unless the data passes a day boundary; defaults are 
chosen if no other selection is made. 

Section C, shown in Figure 3, supplies 
information such as the Standard Formatted Data 
Unit { SFDU) label or the skeleton table associated 
with a particular CDF. The SFDU label contains the 
global metadata from a CDF, and the skeleton table 
(discussed in section 2.1) lists all variables and their 
associated attributes as well as the global metadata. 
Users can obtain statistics on a particular CDF data 
set with a call to a CDF toolkit routine. It is also 
possible to print or erase the screen containing the 
graphics. Section D in Figure 3, controls the actions 
such as to plot, to rescale, etc. Figure 3 g shows 
options for rescaling the plots. 

Underneath the user view is code that takes 
advantage of the global and variable attributes 
which are standard with the ISTP Guidelines. Users 
are isolated from the format of the data and so can 
plot data without any knowledge of CDF or the 
ISTP Guidelines. Defaults are chosen from the 
attributes associated with the variables in the CDF: 
LABLAXIS, UNITS, SCALEMIN, and 
SCALEMAX. If LABLAXIS and UNITS are not 
included in the CDF, “no such entry” will appear in 
their place. For this example, the resulting plot 
using IMP-8 magnetometer data is shown in 
Figure 4. This data is from 7 July 1993 and was 
produced with version 1 .6 of the Key Parameter 
Generation Software (KPGS). The magnetic field 
measurements are shown in GSM polar coordinates. 
The labels and units, as well as the scales, came 
from the variable metadata provided in the CDF. 

The magnetic field data was provided as a vector (a 
single variable with 3 components) but the 
components and their associated metadata were 
split apart and plotted separately. Later releases of 
the software will include use of other attributes: 
FILLVAL. DEPENDJ), SCALETYP, 

AVG_TYPE, OFFSETJ3, DELTA_PLUS_VAR, 
and DELTA_MINUS_VAR. 
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Figure 4. IMPS magnetic field key parameter data from 7 July 
1993 . 


4. SUMMARY AND CONCLUSIONS 

The International Solar-Terrestrial Physics 
Program will combine existing and future space- 
craft and ground-based measurements from four 
agencies: NASA, ESA, ISAS, and IKI spread over 
many countries. This approach to global science 
places special emphasis on data standards in order 
to provide easy and effective data sharing among 
the Space Physics community. The Common Data 
Format (CDF) with the ISTP Guidelines provides 
transportable data sets and software and isolates the 
end user from low level programming. The ISTP 
Key Parameter Visualization Tool allows the user 
to plot data conforming to the ISTP CDF standard 
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without needing outside information or detailed 
descriptions of the data. 
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A software environment has been developed to assist in 
analysis and visualization of numerical outputs from 
NCAR ’s thermospheric general circulation models. The 
codes producevector graphics, raster imagery, and time- 
dependent animation of model results, with provisionfor 
comparison with empirical models and relevant data from 
satellite and ground based instruments. Separate pro- 
grams are executed on the NCAR CRAY YMP8/864 and 
HAO divisional Sun workstations, providing both large 
volume high speed batch and custom interactive comput- 
ing environments. The visualization system assembles a 
suite of existing software packages including NCAR Graph- 
ics, IDL, netCDF, and the Xt intrinsics library with the 
Athena widget set. 
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1. MODEL INTRODUCTION 

The principal object for which the visualization 
system has been developed is the thermospheric 
general circulation model (TGCM) and its several 
descendants. The original National Center for 
Atmospheric Research (NCAR) TGCM was intro- 
duced by Dickinson et al. ( 1 98 1 ) as a three-dimen- 
sional, time-dependent numerical general circula- 
tion model of the thermosphere. Over the past 
decade, the model has been extended to include the 
effects of upward propagating tides from the middle 
atmosphere, auroral particle precipitation, and self- 
consistent mutual coupling between the thermo- 
spheric neutral gas and ionospheric plasma (ther- 
mosphere-ionosphere general circulation model, or 
TIGCM). In 1991 , an interactive dynamo model 
was introduced to calculate self-consistent electro- 
dynamic interactions between the thermosphere and 
ionosphere [Richmond etal., 1992], allowing 
calculation of global electric potential distribution 
and ion drift velocities (thermosphere-ionosphere- 
electrodynamics general circulation model, orTIE- 
GCM) . Most recently, a version of the model has 
been extended downward into the mesosphere and 
upper stratosphere (thermosphere-ionosphere- 
mesosphere-electrodynamics general circulation 
model, orTIME-GCM) [Roble and Ridley, 1993], 

These models use an effective 5 degree global 
latitude/longitude grid with 25 or 45 constant 
pressure surfaces in the vertical, for the TIE-GCM 
and TIME-GCM, respectively. The model time step 
is typically 5 minutes, with model histories being 
saved typically every hour for a full day simulation. 
The model is run on the NCAR CRAY Y-MP8/864 
(8 processors, 64 MW central memory with SSD), 
using about 23 minutes of CPU time for a one-day 
TIE-GCM simulation. Multiple histories are stored 
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in history volume files on the NCAR Mass Store 
System (MSS) (the principal storage medium is 
IBM 3480 tape cartridges of 200 Mb each). Full 
sized history volume files are about 170 Mb. 

Fifteen fields are saved on TIE-GCM histories, and 
30 fields on TIME-GCM histories. 

2. POST-MODEL PROCESSING 

Post- model processors developed for 
visualization obtain CRAY binary model history 
files from the NCAR MSS. Codes executing on the 
CRAY read these histories directly and call NCAR 
Graphics to produce contours or surface plots of 
two-dimensional slices selected from the three- 
dimensional model grid. Because the large raw 
history files are awkward to download and read 
with IEEE Sun machines, the CRAY codes are also 
capable of writing selected portions of the histories 
as netCDF files, which may subsequently be used 
by the Sun processors. The Sun codes operate in 
an X-Window environment and use IDL to produce 
two-dimensional raster images (Figures 5, 6, and 7). 
NCAR Graphics is used to produce monochrome or 
color-fill contour maps (Figures 1-4 and 8-10). The 
same netCDF history files may be read by both the 
IDL and NCAR Graphics processors, and may also 
be used with other software being developed by the 
visualization community at large. 

Capabilities of the post-model processors may 
be described in five functional categories: 

1 . snapshot; 

2. time-dependent; 

3. animation; 

4 . difference fields; and 

5. model-data comparison. 

Snapshot codes display model results at a single 
universal time (UT) (one history). This includes 
contouring fields over 1 1 possible mapped projec- 
tions of the globe (e.g., cylindrical equidistant in 
Figures 1,3, and 5; satellite view in Figures 2, 4, 
and 8; and polar sterographic in Figure 7), zonal 
and meridional vertical cross-sections (e.g., Figure 
6), and vertical profiling of global means. Linear or 
log interpolation of fields from the pressure coordi- 
nate system of the model to constant height surfaces 
is also a capability of the snapshot processors. 

Time-dependent codes display model results 
from multiple UTs, typically one-to-ten days with 
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hourly resolution. Fields are contoured with UT and 
solar local time on the X-axis, and latitude, pres- 
sure, or height on the Y-axis (e.g., Figure 10). 

These plots help visualize the model response to 
time varying quantities such as those found during 
geomagnetic storm events. Time-dependent vertical 
profiles at selected grid point locations (“station 
processors”) are often used for comparison of 
model results with ground-based instruments. 

Time-dependent analysis of the model is 
assisted by animation of time-interpolated results. 
For example, a model field may be interpolated 
between hourly histories, and contoured over a 
satellite view projection with each frame interval 
representing 20 minutes of model time and 5 
degrees rotation of the earth. If this frame series is 
recorded to video tape at a rate of 10 frames per 
second, a ten-day model simulation may be viewed 
in a 72 second video animation. Color raster image 
and monochrome or color- fill contour animations 
have proved useful in analyzing time-dependent 
phenomena in the models. The NCAR Text and 
Graphics System (TAGS) is capable of recording 
image sequences in a variety of video formats 
including VHS, SVHS, Betacam-SP, and Umatic- 
SP. Before recording finished sequences to video 
tape, animations may be previewed on local work- 
stations using the Ximage software developed at the 
National Center for Supercomputing Applications 
(NCSA). 

Difference field codes obtain results from two 
separate model runs and display raw or percentage 
differences of selected fields. This is useful for 
isolating the effects of altering model input quanti- 
ties, or evaluating model predictions under different 
geophysical conditions. 

Several independent codes have been developed 
for comparison of the TGCM models with empiri- 
cal models and instrument data. Neutral atmo- 
sphere simulations are compared with the mass 
spectrometer and incoherent scatter model (MSIS) 
[Hedin, 1991], and ionospheric parameters are 
compared with the International Reference Iono- 
sphere (IRI) [Belitiza, 1986, 1990]. Several studies 
have been made with ground based incoherent 
scatter radars, as well as with satellite programs 
including the Air Force SETA satellites, the Dy- 
namic Explorer Satellites (DE-I and DE-II), and the 
Upper Atmosphere Research Satellite (UARS). 
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3. CONCURRENT-MODEL PROCESSING 

Often model development and analysis requires 
knowledge of internal parameters calculated during 
a model run, but not preserved in history files 
following model completion. To allow display and 
evaluation of these parameters the model itself is 
introduced into the visualization software and 
allowed to start up from a previously written history 
and execute a single time step. These programs 
execute the model code as a subprogram and are 
termed concurrent model processors. After the 
model has completed a time step, the fields are 
displayed in the same manner as the post-model 
processors. An advantage of this approach is that 
code transfer from the model to the processors is 
unnecessary, eliminating the potential for diverging 
versions of model and processor to produce con- 
flicting results. Concurrent model processors assist 
in diagnostic analysis of parameters that exist only 
during model execution such as heating and cooling 
rates, conductivities, ion drag forcing, and several 
terms from the momentum and thermodynamic 
equations used in the models. 

Acknowledgments. The National Center for Atmo- 
spheric Research is sponsored by the National 
Science Foundation. IDL (Interactive Data Lan- 
guage) is a product of Research Systems, Inc. 
(Boulder, CO). NetCDF (Network Common Data 
Form) is a product of Unidata (Boulder, CO). 


REFERENCES 

Bilitza.D., International reference ionosphere: Recent 
developments, Radio Sci.,21, 343-346, 1 986. 

Bilitza, D., International reference ionosphere 1 990, 
National Space Science Data Center/World Data 
Center A for Rockets and Satellites, NSSDC/WDC-A- 
R&S 90-22, Nov. 1990. 

Dickinson, R.E., E.C. Ridley, and R.G. Roble, A three- 
dimensional general circulation model of the thermo- 
sphere,/. Geophys. Res., 86, 1499-1512, 1981. 

Hedin, A.E., Extension of the MSIS thermosphere model 
into the middle and lower atmosphere, J. Geophys. 
Res. ,96, 1159-1172,1991. 

Hedin, A.E. et ah. Revised global model of thermo- 
sphere winds using satellite and ground-based 
observations,/. Geophys. Res., 96, 7657-7688, 1991. 

Richmond, A.D., E.C. Ridley, and R.G. Roble, A 
thermosphere/ionosphere general circulation model 
with coupled electrodynamics, Geophys. Res. Lett . , 
79,601-604,1992. 

Roble, R.G., E.C. Ridley, A.D. Richmond, and R.E. 
Dickinson, A coupled thermosphere/ionosphere 
general circulation model, Geophys. Res. Lett., 15, 
1325-1328,1988. 

Roble, R.G., and E.C. Ridley, A thermosphere-iono- 
sphere-mesosphere-electrodynamics general circula- 
tion model (TIME-GCM): Equinox solar minimum 
circulation (30-500 km), Geophys. Res. Lett.. 19, 
submitted, 1993. 



98 


Visualization Techniques in Space and Atmospheric Sciences 



TOTAL DENSITY (0+02+N2) 
DAY=79088 UT= 9.00 HT=300.00 


6 12 18 
LOCAL TIME 


2.00E- 14 3.50E- 14 5.00E-14 


Figure I. Colorfill contours of totaldensity(0+02+N2) at 300 km altitude mapped onto a cylindrical equidistant global projection. The graph 
at bottom shows the geophysical index Kp during a ten day geomagnetic storm period in March, 1 979. The time of the density image is at 
the end of the red portion of the Kp graph. This figure is a still frame from a video animation of a ten day model simulation. 



Figure 2. Color fill contours of atomic oxygen at approximately 120 km, mapped onto a satellite view projection. Kp graph as 
in Figure 1. This plot shows depletion of atomic oxygen at high northern latitudes during the March, 1979, geomagnetic storm 
period. This is also a still frame from a video animation. 
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Figure 3. Color fill contours of neutral temperature on a cylindrical equidistant projection, with overlying vectors of neutral wind 
velocities. The effect of semi-diurnal upward propagating solar tides are visible in the temperature contours at this altitude 
(approximately 120 km). 



Figure 4. Neutral temperature and wind response to the July, 1991, total solar eclipse at an altitude of approximately 300 km. 
These are difference fields between two TIE-GCM runs: one with the eclipse subtracted by one without the eclipse. The solid line 
shows the path of totality. Vectors indicate winds converging into the area cooled by the eclipse's shadow. 
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Figure 5. Main control window of an interactive snapshot Sun processor running IDL with OpenWindows. The draw widget shows 
a raster image of electron density at the -4 pressure surface ( about 120 km) on a cylindrical equidistant projection. Density highs 
are seen at solar local noon (near center of the image), and at high latitudes (the auroras). 
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Figure 6. Raster image with overlaid contours showing a vertical slice of electron density along the 57.5 degree north latitude circle. This 
image was produced by the snapshot Sun processor running IDL (as in Figure 5). Local noon is at zero degrees longitude. The vertical 
axis is in a log pressure scale. Pressure level -7 corresponds to 95 km, and pressure level +5 ( top of the model) is typically 400 km during 
solar minimum conditions and 600 km during solar maximum conditions. 



Figure 7. Favorable model-data comparison of the typical dual vortex wind pattern at high latitudes near 300 km altitude. This is a 
stereographic projection with a raster image of neutral temperature. Predicted neutral wind velocities ( white arrows) are from the TIE-GCM, 
and observed wind velocities ( black arrows) were measured during an orbit pass of the Dynamic Explorer-11 satellite. 
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Figure 8. Color fill contours of electron density ( pressure level -4, approximately 120 km) on a satellite view projection showing the effect 
of the aurora in the northern hemisphere. This plot was produced by the snapshot post-model processor calling NCAR Graphics. 



Figure 9. Stack of five color contour maps showing nitric oxide densities at different vertical pressure levels (Zp -6 to -2 corresponding 
to approximately 100 to 150 km altitude ). Each plane is a global latitude versus, longitude map. This plot shows the vertical variability 
of high latitude nitric oxide density. Numbers to the right under the Zp values show the range of nitric oxide (logjo) at each level. 
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Figure 10. Atomic oxygen density difference fields (at approximately 300 km) over an 8 day period along the 70 degree west 
longitude ( latitude on the Y-axis). A strong geomagnetic storm event commenced during the third day. Decreased atomic oxygen 
densities are predicted at high latitudes throughout the storm (see also Figure 2). 
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The National Collaborator y concept has great potential 
for enabling "critical mass ” working groups and highly 
interdisciplinary research projects. We report here on a 
new program to build a prototype collaboratory using the 
Sondrestrom Upper Atmospheric Research Facility in 
Kangerlussuaq, Greenland and a group of associated 
scientists. The Upper Atmospheric Research Collaboratory 
(UARC)isa joint venture of researchers in upper atmo- 
spheric and space science, computer science, andbehav- 
ioral science to develop a testbed for collaborative remote 
research. We define the "collaboratory” as an advanced 
information technology environment which enables teams 
to work together over distance and time on a wide variety 
of intellectual tasks. It provides: (1) human-to-human 
communications using shared computer tools and work 
spaces; (2) group access and use of a network of informa- 
tion, data, and knowledge sources; and (3) remote access 
and control of instruments for data acquisition. The 
UARC testbed is being implemented to support a distrib- 
uted community of space scientists so that they have 
network access to the remote instrument facility in 
Kangerlussuaq and are able to interact among geo- 
graphically distributed locations. The goal is to enable 
them to use the UARC rather than physical travel to 
Greenland to conduct team research campaigns. Even on 
short notice through the collaboratory from their home 
institutions, participants will be able to meet together to 
operate a battery of remote interactive observations and 
to acquire, process, and interpret the data. 
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1. INTRODUCTION 

We describe here a multidisciplinary effort 
linking research in computer science, behavioral 
science, and upper atmospheric and space physics. 
The purpose of this effort is to conceive, develop, 
deploy, test, evaluate, and integrate a high perfor- 
mance group centered computer environment into 
collaborative experimental activities ongoing in the 
space science research community. Such group 
computing environments, we predict, will become 
an important component of the National Informa- 
tion Infrastructure (Nil) initiative, which is envi- 
sioned as the high performance communications 
infrastructure to support future national scientific 
research. 

Because upper atmospheric and space science is 
influenced by a broad and diverse set of regions and 
processes, progress requires an ever increasing 
synthesis of information from a wide variety of 
experimental data and the interaction between 
scientists in varying areas of research. Thus, the 
computing infrastructure to support this research is 
becoming increasingly necessary to support col- 
laborative efforts to acquire and synthesize infor- 
mation. The technology to enable such interactions 
is now developing rapidly and the United States is 
making a major national commitment to develop 
and deploy such technology. 

By way of background, the term telescience 
was coined in 1985 to capture the notion of an 
investigator participating in experimental opera- 
tions at a distance supported by an electronic 
infrastructure. Such a notion was developed to 
characterize some types of experimental operations 
envisioned for the space station. A variety of 
scenarios were envisioned in which the principal 
investigator on an experiment was not able to be 
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physically present on the space station, but whose 
critical knowledge and insight were necessary to the 
success of the experiment. Thus, a virtual presence 
for the investigator, working with the space station 
crew, is the logical requirement for the success of 
these experiments. 

The Sondrestrom Upper Atmospheric Research 
Facility in Kangerlussuaq, Greenland is a remote 
ground facility, jointly supported by the National 
Science Foundation and the Danish Meteorological 
Institute and maintained and operated by SRI 
International [Kelly, 1983; Wickwar et al., 1984; 
Claueretal., 1984], This facility was proposed to 
NASA, in 1989 to be a good analogue to develop, 
test, and e valuate the tools necessary for 
telescience. The Sondrestrom facility could be 
utilized to test a variety of remote experimental 
activities which may be similar to those envisioned 
for a space station. For example, the facility 
contains complex equipment which is maintained 
and operated by a site crew, is located in a remote 
site with limited logistical support and limited 
travel access, and often requires that the investiga- 
tors travel to the site to undertake experimental 
operations. Remote access to support experimental 
operations could have many benefits to the science 
undertaken at the Sondrestrom facility, as well as 
provide an effective testbed in which to develop the 
tools to enable such interactions. 

A telescience testbed program centered upon 
the Sondrestrom incoherent scatter radar was 
supported by NASA for two years. With this 
support, an electronic satellite link to the facility 
was established and the development of X-Win- 
dows based software to provide remote display of 
real time data from the radar was developed. The 
Sondrestrom facility, however, contains a variety of 
separate, but often synergistic, experiments oper- 
ated by a variety of scientists who are distributed 
around the world. Thus, the telescience infrastruc- 
ture developed through the NASA supported 
telescience testbed has provided the enabling 
capability for a much more ambitious activity: the 
Upper Atmospheric Research Collaboratory 
(UARC) testbed. 

The notion of telescience was greatly expanded 
in 1989 with the concept of the National 
Collaboratory [National Collaboratories, National 
Academy Press, Washington, DC, 1993]. The 
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National Collaboratories report, prepared under the 
auspices of the National Research Council, con- 
cluded that “collaboratory testbed programs have 
the potential to address important scientific needs 
while simultaneously representing a key step 
toward developing national and global infrastruc- 
tures.” The collaboratory is described as a “center 
without walls in which the nation’ s researchers can 
perform research without regard to geographical 
location — interacting with colleagues, accessing 
instrumentation, sharing data and computational 
resources, and accessing information from digital 
libraries.” Thus, the collaboratory infrastructure is 
envisioned to support distributed interaction be- 
tween people, access to remote information sources 
and digital libraries, and access to and interaction 
with remote and unique facilities. 

It is the aspects of access and interaction with 
remote and unique facilities, as well as distributed 
interactions between people that are the focus of the 
prototype testbed which we have named the Upper 
Atmospheric Research Collaboratory (UARC). 
Beginning in September, 1 992, the Computer 
Information Science and Engineering Directorate 
(CISE) in cooperation with the Atmospheric 
Sciences Directorate of the National Science 
Foundation has funded this major collaboratory 
testbed within the space science community. It was 
felt that progress would be accomplished most 
rapidly by using a testbed approach formed around 
a collaborating group in a real world environment. 
Thus, the UARC has been formed around the 
ongoing research activities among a group of space 
scientists engaged in experimental operations at the 
Sondrestrom Upper Atmospheric Research Facility 
in Greenland. The UARC is being created using a 
user-oriented, rapid prototyping research approach 
at the University of Michigan, SRI International, 
and the other testbed sites — the Danish Meteoro- 
logical Institute, the University of Maryland, and 
the Lockheed Palo Alto Research Laboratory . In 
this project, an interdisciplinary group of investiga- 
tors is supported to conduct coordinated experimen- 
tal research related to creating and evaluating 
distributed environments to support team science. 
The goal is to build a networked environment to 
support interactive observational campaigns and 
team science using multiple instruments and a 
distributed group of investigators located at their 
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home institutions rather than at the Sondrestrom 
facility. 

2. THE SONDRESTROM 

COLLABORATORY TESTBED 

The instrumentation presently supported to 
participate in this prototype environment, their 
principal investigators, and their institutions 
include: (1) incoherent scatter radar, John Kelly, 
SRI International; (2) imaging riometer, Peter 
Stauning, Danish Meteorological Institute and Ted 
Rosenberg, University of Maryland; (3) 
magnetometers, Eigil Friis-Christensen, Danish 
Meteorological Institute; (4) Fabry Perot 
interferometer and optical spectrometers and 
photometers, RickNiciejewski and Tim Killeen, 
University of Michigan; and (5) all-sky imaging 
television camera, Steve Mende, Lockheed Palo 
Alto Research Laboratory. 

We plan for the testbed to evolve through three 
design phases: The “wire service” phase, the 
“point and talk” phase, and the “fully shared 
control” phase. We are presently in the wire 
service phase, where, using existing technology, the 
instruments at Sondrestrom generate a stream of 
data that is collected, stored, forwarded, and 
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displayed at user sites. The point and talk phase is 
an extension to the wire service phase that allows 
researchers to interact jointly while they are located 
at several distributed sites. We have entered this 
phase with the implementation of a set of simple 
communications windows that broadcasts discus- 
sion to all interactive users. The fully shared 
control phase builds upon the point and talk phase 
to provide more natural and fluid group interactions 
using emerging voice and video communication 
technologies. The fully shared control phase also 
supports the remote control of the instruments. 

Figure 1 provides a schematic view of the 
testbed showing both initial configuration at 
Sondrestrom (a), and the user’ s laboratories (b), and 
also showing the technology which we hope to 
achieve and, in part, operationalize during the 
testbed (c). In the first level of communications 
represented by (b), a user may log onto a computer 
in Sondrestrom or a local server located elsewhere 
which is itself logged onto a computer in 
Sondrestrom via the Internet. Data may be trans- 
ferred via standard file transfer techniques for 
display. This is being automated to become the 
wire service phase now. The project will evolve to 
the level of communication illustrated in (c). Here 
we show multiple users showing multiple data in 
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windows which support interactive poi nting, 
annotation, sketching, text, voice, etc. Control 
panels will also permit investigators to interact with 
their instruments while also providing safeguards to 
prevent conflicting or inappropriate actions. Parts 
of (c) have been implemented, including the ability 
for multiple users to view data in multiple windows 
and communicate through a shared text window. 

Because the project involves a relatively small 
number of institutions and users, we have chosen 
the luxury of utilizing a homogeneous computing 
environment and to not deal with interoperability 
issues. This permits us to focus on the issues 
directly related to design specifications for a 
collaborative environment. We have chosen to use 
the NeXT computing environment including the 
NeXTStep Interface Builder. NeXTStep was 
chosen because it tightly integrates both the operat- 
ing system, supporting software, and development 
environment under a unifying object-oriented 
software library. NeXTStep also allows instances 
of software objects running on one workstation to 
be seamlessly distributed to other workstations on 
the Internet. This supports the modular develop- 
ment of distributed group-oriented software in an 
evolutionary fashion. It represents, in our view, the 
best existing example of the type of computer 
system which will be common in the future. Also 
very important to our approach is that the use of 
this homogeneous, object-oriented environment has 
enabled us to pursue a rapid prototyping approach 
to system design and testing. We envision a cycle 
time of about 3 months for software design, devel- 
opment, deployment, evaluation, and revision. 
Development can proceed much faster using the 
NeXT development system than systems, such as 
X-Windows, for example. It is our opinion that 
greater efficiency will be achieved by doing the 
development in the NeXT environment and then 
porting the developed software to X if this becomes 
desirable at a later time. Also, since distributed 
objects are now supported separately in the UNIX 
environment, it will be possible to continue to 
utilize the NeXT server model with distributed 
objects passed to X-Window clients for display. 
Such a strategy will permit the further extension to 
a wider community in an efficient way since only 
the display software will have to be rewritten for 
the X-Window clients. 
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In Figure 2 we show a screen display from the 
current software captured from a past experimental 
operation. The various windows on the display 
show the data acquired by the radar — line of sight 
velocity (bottom right) and electron density (top 
right) from an azimuth scan. Also shown are the 
communications windows — one for entering the 
user’s messages (top left), and the other one which 
displays the messages sent by all users (bottom 
left). While there are threads of different conversa- 
tions which develop in the message window, this 
has not been a problem and users have been able to 
ignore or participate in conversations according to 
their desired involvement. Another data display 
window showing ionospheric plasma density as a 
function of time and altitude is partially visible 
lying underneath the two azimuth scan data display 
windows. The control menu window for the 
program is shown in the upper left corner and icons 
for other NeXT tools and applications are shown 
along the right side of the display. 

In Figure 3 we show the extent of the present 
UARC network. In all, 14 workstations are sup- 
ported and are distributed between developers and 
social scientists at the University of Michigan and 
the space science users who themselves are distrib- 
uted between Denmark, Greenland, Maryland, 
Michigan, and California. The various instruments 
in Sondrestrom are controlled by PCs and are 
connected to the Sondrestrom local area network 
(LAN). A VAX computer and a router provide 
connection to the external TCP/IP internet. 

As prototypes of various capabilities emerge, 
they will be tested and evaluated. The tools will 
develop through a series of iterations whereby 
feedback from the evaluations define new require- 
ments for the tools. As prototype tools are refined 
and their utility demonstrated, they will become 
operational and will be maintained on the founda- 
tion layer of the testbed. 

A fundamental characteristic of our approach is 
a rapid prototyping cycle involving the definition of 
user requirements, prototyping, deployment, testing 
and evaluation, and revision. It is our goal to 
operate this cycle typically on a 3 or 4 month cycle 
time. Using the NeXT and an object-oriented 
approach to design and implementation, we have 
been able to “turn over” this cycle twice in the past 
six months. Specifically, the initial version of the 
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UARC Project software provides radar data distri- 
bution (from Greenland to anywhere in the Internet) 
and 2 types of data displays. That version was 
developed in 5 weeks. The performance of this first 
version was evaluated during a radar campaign in 
April 1993. Utilization of the software was surpris- 
ingly robust. It supported, at times, over 1 2 sites 
including ones in Denmark, Greenland, and in the 
US, all observing the operations of the radar and 
interacting over choices of the radar operation 
modes. Essentially, the same version of the soft- 
ware has been demonstrated publicly at the Ameri- 
can Geophysical Union Spring Meeting in May 
[Claueretal., 1993], User reactions to that first 
version were then incorporated into a major revi- 
sion of the software, which was used to support a 
series of radar experiments in June, 1993. 

An important and unique aspect of this multi- 
disciplinary effort is the behavioral science re- 
search. Behavioral scientists are involved in this 
project in two ways. First, they are strongly in- 
volved in the software design. The behavioral 
scientists have taken the lead to define the objects 
which the space science users utilize and assist the 
computer scientists to codify these objects in our 
object oriented programming approach. The 
behavioral scientists are directing the object ori- 
ented design methodologies and are monitoring the 
use of the software to assist in the iterative redesign 
of the system. It is important for this project to 
develop generic design specifications which may 
have wide spread application to electronic collabo- 
ration across disciplines. The behavioral science 
team has a strong history in the development and 
evaluation of object oriented software to support 
team engineering efforts. Second, the behavioral 
scientists are providing documentation regarding 
the effect of introducing this new technology into 
the scientific practice of the space scientists who 
are using the testbed. Concurrent with the startup 
phase, the behavioral scientists have collected 
information about current work practices among the 
space scientists at the various sites. These measure- 
ments will provide a basis upon which to assess the 
effects of the new tools as they are implemented, 
tested, and evaluated in the testbed. An important 
outcome of the project will be a quantitative docu- 
mentation of the benefit or the changes which result 
from the utilization of the collaboratory. 


There are a number of interesting technical 
issues associated with providing the kinds of 
capabilities needed by the Sondrestrom users. To 
restate what we said earlier, the three classes of 
capabilities we are providing are: sharing of data 
from the Greenland instruments in real time over a 
wide area network; control of the instruments over 
the network; and collaboration tools that allow the 
scientists to work together over the network. We 
are developing a framework within which we can 
explore these technical issues and evolve tools that 
will work under the operating conditions found in 
the space science research (e.g., complex data 
displays and widely separated workstations). Our 
approach is a mixture of an object-oriented repre- 
sentation of the basic constructs (e.g., data, instru- 
ments, analysis procedures, data visualization tools) 
and a data flow model for handling the coupling 
between instruments and user interactions. 

The need for an explicit model of the user 
interaction becomes clear when we consider, for 
example, one of the key collaboration capabilities 
we plan to offer — the ability to share annotations on 
the data from the instruments. How should the 
annotations be represented? Since each user can 
choose to display data from a particular instrument 
in different ways, it seems like the best representa- 
tion is to associate the annotations with the data 
itself rather than with the visual display. This way 
each user can examine the annotations in the 
context of the particular way they have chosen to 
display the data. Of course, in some cases the 
annotations may not make sense unless the display 
is of a particular type. But, then someone reading 
the annotation could easily switch to that type of 
display, and the annotation would still be associated 
with the relevant data item. 

We are exploring how the annotations them- 
selves should be represented and shared. We 
envision the annotations being of multiple data 
types themselves, such as text, drawings, or voice. 
We will represent the annotations in such a way that 
any degree of sharing is possible, such as read-only 
or read-write access to others’ annotations. These 
capabilities must be offered so that they work over 
the wide area network with the typical delays that 
make precise synchronization impossible. 

Other capabilities will be added based upon the 
emerging needs of the user community, as well as 
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the enabling technologies produced by the computer 
science research team. The goal of the basic computer 
science and engineering (CSE) research in the project 
is to explore new paradigms (with appropriate meta- 
phors ) for tools supporting group interaction over real- 
time data. The CS E group is exploring the structure of 
toolkits, window systems, and underlying support for 
sharing, migration, and selective replay of windows 
andcollaborative sessions. 

3. DISCUSSION 

The collaborative technology developed 
through the research undertaken within the UARC 
project should be widely extendable through the 
space science community, as well as to other areas 
of science and, perhaps, beyond. Since much of the 
collaboration technology is expected to also have 
generic value, the development here may also 
provide future benefit to the design and use of 
similar coilaboratories in other areas, such as 
education, engineering, and business. 

As the project progresses, we intend to make it 
more comprehensive by adding additional instru- 
ments and users. As we consider the possibilities 
for extension of the UARC, issues of 
interoperability become more important. Since the 
NeXTStep operating system is now available for 
Intel 486 processors, it is relatively easy and 
inexpensive to join the homogenous UARC devel- 
opment environment. We note also that Hewlett 
Packard and SUN have announced they will support 
NeXTStep on their high performance RISC work- 
stations. Still, we realize this is not a suitable 
solution for general expansion. 

Since distributed objects are now supported 
independently in the UNIX environment, it is 
possible to utilize the UARC NeXT server in 
conjunction with an X-Windows display environ- 
ment at individual user sites. The only develop- 
ment which, is required for this to happen is the data 
display software for the X-Windows environment. 
The UARC distributed objects could be passed to 
the X-Windows system, or any other system 
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supporting distributed objects, and the display 
software for the objects can be developed sepa- 
rately. This allows us to maintain the foundation of 
the collaboratory and the rapid prototyping devel- 
opment environment with a clean separation from 
the user display environment. It is this approach 
that we feel will be the most effective and efficient 
for more general future extension of the UARC to 
additional users. 
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In anticipation of large data sets associated with a 
number of atmospheric imaging instruments being 
preparedfor long term global coverage, NRL is devel- 
oping graphical interfaces for all aspects of the pro- 
gram. For the first of these projects, RAIDS (the 
Remote Atmospheric and Ionospheric Detection Sys- 
tem), a graphical approach to data handling, visual- 
ization, and analysis is envisioned and will set the 
stage for the satellites that follow. An overall system of 
hardware and a set of software "tools, ’’ that will allow 
for both the routine handling of all data and the 
analysis of large data sets assembled by scientists and 
instrument engineers, are currently being developed. 

The software for standard processing and visualiza- 
tion of instrument data is independent of computer 
platform and will allow for easy adaptation from one 
experiment to another. The processing will produce 
data sets that have similar characteristics, allowing 
for easy comparison of data obtained under similar 
circumstances. 

The visualization of both the engineering and scientific 
data is an important part of the system. By creating 
graphical environments for engineering evaluations 
and for scientific analysis data sets can be viewed and 
analyzed rapidly. This rapid analysis of data will 
contribute towards a greater portion of the RAIDS 
data being utilized. 


! SFA, Inc., Landover, MD 

2 Software Technology, Inc., Alexandria, VA 


1. INTRODUCTION 

The Naval Research Laboratory (NRL) is 
currently preparing a number of atmospheric 
imaging instruments to be launched aboard a 
variety of satellites. The first of these projects, the 
Remote Atmospheric and Ionospheric Detection 
System (RAIDS), is due to be launched during 
1 994. Over the course of its lifetime, RAIDS is 
expected to produce a data archive of 220 gigabytes 
or approximately 145 megabytes of compressed 
data per day. For a volume of data such as this, if 
scientists wish to analyze any significant portion of 
the data, methods allowing for processing and 
analyzing large amounts of data quickly are 
required. 

An important part in the development of 
analysis software is the creation of meaningful 
methods of visualizing variations in data over 
longitude, latitude, altitude, and time. The data 
analysis is facilitated by a Graphical User Interface 
( GUI ), using “tools” created specifically to 
process the data from a particular instrument. 
However, users can easily customize the GUI by 
creating new graphical tools and including them in 
a common library. In addition, data sets will be 
analyzed and recorded on video tape or optica! disk, 
allowing rapid searches of large data sets for events 
of interest and permitting subsequent presentations . 

NRL is in the late stages of developing an 
automated and graphical data processing and 
analysis system. This system involves all aspects of 
the experiment, including mission planning, data 
processing, experiment evaluations, simulation, and 
data analysis. By automating the majority of the 
data handling, engineers can quickly diagnose and 
correct problems, while scientists can concentrate 
on data analysis. 
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In the following sections, aspects of the NRL 
system will be explained. The next section will 
describe the flow of data through the system and 
how this flow is to be maintained. Then, a descrip- 
tion of how coordination between the science and 
engineering teams will be achieved. Following 
mission coordination, the ongoing evaluation of the 
instrument and data integrity, and how this will be 
performed, will be d iscussed. The remainder of the 
paper will be devoted to methods of data handling 
and data visualization that will be applied to the 
analysis of the collected data. 

2. MISSION PLANNING AND 

COORDINATION 

During the experiment lifetime, coordination 
between science objectives and engineering con- 
cerns will always be an issue. Specific scientific 
objectives, which should be met, are the coordina- 
tion of upper atmospheric observations with other 
experiments and providing observations of specific 
phenomena at specific times. Along with these 
science objectives, engineers will be concerned 
with ensuring the safety and longevity of the 
instrument, calibration, and instrument sensitivity. 

By using software which can predict satellite 
and experiment parameters, many of the engineer- 
ing and science objectives can be easily met. By 
entering orbital and trajectory information, other 
orbital parameters which are of interest can be 
computed. These parameters can then be viewed 
graphically by the scientist for mission planning 
and the parameters can be saved for later use in 
computing and planning. Using information gener- 


ated by the software, engineers and scientists can 
graphically see where the instrument will be and 
what it will be viewing, allowing them to make 
decisions benefiting both sides. 

As the experiment goes on, the engineers must 
continue to monitor the instrument sensitivities and 
the satellite attitude. This can be accomplished by 
using the background stars that will be contained in 
some of the data. Software will be used to generate 
the spectrum that the instrument will see from these 
stars. These spectra can then be compared over time 
to see how the instrument sensitivity changes over 
time. Attitude can also be monitored by seeing how 
the stars progress through the field of view of the 
instrument. 

3. DATA FLOW AND PROMOTION LEVELS 

The majority of the NRL system is in place to 
process and archive the incoming data. This system 
is written in ANSI - C because of the availability of 
C compilers, the ANSI standard, the extensive 
knowledge of C among the programming commu- 
nity, and the suitability of C to the processes of bit 
shifting and variable size arrays. The purpose of 
this portion of the system is to ingest data from the 
experiment and convert it into machine readable 
format, calculate querying and analysis parameters, 
archive the data, update the data base, and create 
“products” which will facilitate the analysis of the 
data. 

Figure 1 shows the overall processing system. 
The data is received at NRL in a compressed form 
which requires expansion onto standard byte 
boundaries before being considered level 1 data. 
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Figure 1. Diagram showing the flow of data through the system. The arrows go from the processing that is performed to the level 

of da ta that is produced. 
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This data is verified to be in a consistent format to 
ensure proper operation of the later stages of the 
processing. 

The next stage is the calculation of attributes 
from the information contained in the expanded 
data stream. There are a total of 64 attributes which 
consist of information, such as altitudes, attitudes, 
angles, and coordinates in different systems. The 
software itself consists of individual subroutines, 
written in C, and their calling routines that will 
compute the information. The calling routines will 
check the data and call the subroutines to perform 
the computation of the data. The computation of the 
attributes will be completed before the next stage of 
the processing system begins. 

At this point, processing will stop and all of the 
information will be archived. Optical disks have 
been chosen as the archive medium for their storage 
capacity per disk. After the data is archived, it 
remains online for thirty days. The archive medium 
is then duplicated so that risk of losing data is 
reduced as low as possible. The duplicates will be 
used after the data set goes offline to support 
requests for back data. As the archive is created, all 
of the attributes that were calculated are used to 
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create a relational data base that can be used to 
obtain subsets of data that meet particular criteria. 
Criteria for subsets can involve any of the 64 
attributes that are calculated plus additional infor- 
mation that is generated directly by the instrument. 
The purpose is to allow the user to obtain data 
directly and quickly. An example of the querying 
screen that will be used is shown in Figure 2. 

After the integration of the day’s data into the 
data base, images of the instrument data and 
environmental parameters are constructed. Parts of 
the instrument data are stripped from the level 1 
data and converted into scientific units, producing 
level 2 spectral data, and the environmental param- 
eters are computed from these level 2 images. 

These images are produced in a standard format 
which will allow for easy viewing by scientists 
without an extensive knowledge of the format and 
requiring little or no pre-provided software. This 
part of the system scans through the data for 
information that matches the search criteria and 
constructs images out of all the information that 
meets this criteria. The output from this part of the 
system is two data files. One contains the actual 
image of the remote sensing data, while the other 
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Figure 2. Sample querying screen from the RAIDS data base. 
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contains the information from the data stream that 
will be used to analyze this data. Attributes are not 
passed with this image. The information required to 
compute the attributes is passed with the image and 
the routines for the computing of the attributes are 
provided to all the users. This is done to reduce the 
amount of redundant data that is to be stored. This 
data will then be archived to magnetic tape and be 
integrated into the data base as auxiliary data 
available from a query. 

Another i mportant component of the data 
analysis will be the use of video. By using video, 
new approaches to data analysis can be developed 
which will allow individuals to view and analyze 
time variations in data that would otherwise be 
difficult to see. Video also allows for easy transport 
of the data for viewing purposes between locations. 
This will allow presentations to showcase the actual 
data that was used for the analysis instead of just a 
single frame or two. 
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4. ENGINEERING VISUALIZATION 

During the experiment, constant monitoring is 
necessary. This monitoring will focus on tempera- 
tures and voltages on the experiment. If any of 
these values leave their nominal ranges, a diagnosis 
and remedial decision must be made. Because of 
the large amount of information that must be 
viewed for proper evaluation of the instrument 
status, methods to allow for easy viewing and 
monitoring of this information are required. 

Two distinct methods have been developed to 
handle the long-term viewing and the monitoring of 
this information. At the lowest level, this will 
consist of routines that will check each value from 
the instrument against a nominal range, and if any 
values occur outside this nominal range, it will be 
reported. By using this tool, engineers will be 
alerted to situations where the experiment is behav- 
ing abnormally. By screening all incoming data, 
constant supervision is unnecessary. 
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Figure 3 . Sample screen of Engineering Station. Shown are the menus for selecting various components to view and the two 
windows in which they can be viewed. 
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Accompanying the continual screening is a 
graphical engineering station. Once a problem has 
been identified, engineers must be able to examine 
the health of the experiment around the time of the 
event. By using this station, engineers can view 
either the history of a particular instrument or single 
values from different instruments simultaneously. 
With this capability, engineers will be able to 
quickly diagnose specifically where the problem lies 
and formulate a solution. 

The engineering station, shown in Figure 3, was 
designed using the IDL X-Windows widget interface 
library. The station is operated almost entirely by 
pull down menus, with keyboard interaction being 
limited to entering frame numbers for viewing. The 
minimization of keyboard interaction reduces the 
amount of time that the user must spend in the 
initialization of the station to view data, and also 
reduces the probability of crashing the system. 

By selecting from various, self-explanatory 
options under various menus, the user sets up the 
particular viewing conditions he wants. At this 
point, he is presented with plot windows for the 
display of the particular data and a set of scrolling 
lists to allow for the selection of the information he 
wants to view. Upon selecting an item from one of 
the scrolling lists, the information he requested to 
view is displayed on the drawing area. By looking at 
the various items on the experiment, an engineer can 
diagnose the overall condition of the experiment. 

5. VISUAL DATA ANALYSIS TOOLS 

With the large volume of data that the RAIDS 
will generate, new ideas about data analysis must be 
used. Previously, it was not uncommon to store data 
on tape and forget about it, wasting data. NRL has 
developed visual data analysis tools (VDATS) that 
will allow easy performance of data analysis, saving 
time and allowing more data to be analyzed. The 
idea is to provide users with a graphical interface 
that pulls together all of the various aspects of data 
analysis into one package. These tools are such 
things as inversion models, simulators, smoothing 
functions, FFTs, overlays, color patterns, and 
various graphing and plotting routines. This will 
allow the user to do those things that he normally 
does to data, within the confines of the interface. If 
the user desires something different, the data can be 
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analyzed interactively using his own methods and 
customized versions of the software. 

In addition to compiling all of the common 
analysis tools, the VDATs are also easily expand- 
able. This expandability will allow the system to 
evolve over time to fit nicely into the data analysis 
process. This expansion would occur when a 
scientist issues a request to the governing science 
team and the team approves the enhancement of the 
system. This process ensures that only those ideas 
which a majority of users feel would be useful will 
be added. 

After initial data analysis, the user may feel the 
need to perform certain actions non-interactively, 
then return later and view the result. While the user 
is doing the initial data analysis, the VDATs can 
optionally record all of the actions into a file. This 
file will then be able to be submitted non-interac- 
tively on another data set, allowing easy construc- 
tion of batch jobs. 

As users log on to the VDAT system, they will 
be placed into the VDAT environment. This envi- 
ronment is a window driven, graphical user inter- 
face. The first thing that they will want to do is 
open a data set and look at the raw, unprocessed 
data. This is done by choosing ‘OPEN' under the 
‘FILE’ selection. After looking at the file, they may 
want to do some smoothing, change the color table, 
plot a horizontal or vertical slice of the image, run 
an FFT on the data, or any number of things. These 
will all be activated by choosing the appropriate 
selection from the menu or sub-menus. At any time 
during the course of this process, the user can select 
to begin recording the actions for later use by 
pressing ‘LOG’ under the ‘FILE’ selection. As this 
is chosen, all actions that are performed while in 
VDATs will be recorded into a file. At the termina- 
tion of the program, or when the user stops it, 
recording will stop. The user can then either con- 
tinue on, or can save the image that he has created 
and exit the VDATs. 

6. APPLICATIONS OF VIDEO RECORDING 

MEDIA 

Another component in the RAIDS system is the 
standard recording of images and information onto 
videotape. With the abundance of videotape machines 
in the world, scientists from NRL will be able to take 
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Figure 4. Sample screen showing a model interface. The input parameters to the model are entered into the grey boxes. The 
background is the main area for the visual data analysis tools. 



Figure 5. Sample screen showing the image slicing tool. By clicking on a point in the image contained in the main window the row 
or column that was selected will be plotted in the smaller windows. 
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data from the laboratory and display the results at 
professional meetings. The second appealing feature of 
video recording is the ability to scan or compare a 
sequence of images for the purpose of identifying 
interesting, dynamical, or qualitative features. Once a 
feature is designated for detailed analysis, the original 
data can be accessed for the frame in which the 
phenomenon appeared. Screening techniques like this 
are essential to retrieving maximum scientific informa- 
tion from large data sets. 

The process of recording onto videotape is a 
two step process. The first part is to record the 
images as they are displayed by the computer on a 
monitor. This monitor is connected to a “Write 
Once Read Many (WORM)” optical disc recorder 
which scans the monitor and can record up to 60 
frames per second. The disk can be archived and 
used as often as necessary to either make new 
videotapes or for the purposes of previewing data. 
From the optical disk, the images can be spliced 
together either in or out of sequence to make a 
movie. This is then recorded onto video tape for 
showing at some later time. With the recording onto 
videotape comes the option of adding or accenting 
features to make a more useful product. Currently, 
NRL is acquiring the ability to do video editing of 
the images, constructing title screens, and even 
adding audio editing capabilities. 

7. SUMMARY 

The system described in this paper is a general 
system that is in place for a general type of instru- 
ment. With each new instrument, the same system 
will be used with modifications in the modular 
processing software. Through automated processing 
and visual data analysis, a much larger fraction of 
the data from the satellites will be able to be used. 
Instead of creating a new system every time an 
instrument is created, “front-ends” will be created 
that will push the data into the standard format. 

With the concepts of data visualization and the 
analysis methods being so new, this system will 
undergo numerous upgrades. As users begin to gain 
comfort with it, new ideas will begin. Because the 
system does not attempt to do everything from the 
start, while allowing for easy expandability, the 
system will be able to grow with the analysis 
community through the years. 
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Critical problems in planning coordinated observa- 
tion campaigns for magneto spheric science include 
the need to predict time intervals when one or more 
observing satellites or ground stations will be con- 
nected along magnetic field lines to other observation 
sites, or when such sites will be located within mag- 
netospheric regions of common interest. The Satellite 
Situation Center (SSC) was created at the National 
Space Science Data Center ( NSSDC) during the Inter- 
national Magnetospheric Study in the 1970s to address 
these problems. The SSC Data System has evolved 
since that era to support potentially complex queries 
by SS C staff and has now been opened to NASA Science 
Internet access via the NSSDC On-line Data Informa- 
tion System (NOD IS). The SSC software, ephemeris 
data base, and access modes are described for the 
Version 2.1 release in 1993. 


1 Space Physics Data Facility, NASA Goddard Space Flight 
Center 

2 National Space Science Data Center, NASA Goddard Space 
Flight Center 

3ISTP Science Planning and Operations Facility, NASA 
Goddard Space Flight Center 

technology Applications Group, Hughes STX Corporation 


1. INTRODUCTION 

The Satellite Situation Center (SSC) is a unit of 
the National Space Science Data Center (NSSDC) 
and NSSDC’ s international counterpart, World Data 
Center A for Rockets and Satellites (WDC-A- 
R&S). SSC personnel and software systems support 
NASA and international space physics activities by 
maintaining an ephemeris data base for scientific 
satellites in geocentric or heliocentric orbits, which 
can be used to plan and support analysis of coordi- 
nated science observations by multiple satellites. 
NSSDC and the Space Physics Data Facility 
(SPDF), which now provides software development 
support for SSC, are jointly managed by the Space 
Science Data Operations Office (SSDOO) at NAS A 
Goddard Space Flight Center with primary contrac- 
tor support from Hughes STX Corporation. 

The SSC was established in the mid- 1 97 Os to 
support and coordinate multi-mission planning for 
the International Magnetospheric Study (IMS) 
[Teague et al., 1982], SSC software resources from 
the IMS era continued to be used in planning for 
missions, such as Dynamics Explorer 1 and 2 (DE 
1 2), the International Sun-Earth Explorer series 
(ISEE 1 , 2, and 3), and the Interplanetary Monitor- 
ing Platform series (IMP 7 and 8). The SSC sup- 
ported the ongoing series of Coordinated Data 
Analysis Workshop studies that began in 1 978. In 
1986 SSC played a major planning and coordina- 
tion role during the multi-mission Polar Region and 
Outer Magnetospheric International Studies 
(PROMIS) program. Later, SSC similarly supported 
ajoint mission of NASA (U.S.A.), IKI (Russia), 
and ISAS (Japan) during 1989-90 for coordinated 
observations with IKI’ s Active satellite under the 
aegis of the Inter-Agency Consultative Group 
(IACG). 
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Among the projects now supported by SSC are 
the Solar Terrestrial Energy Program (STEP), 

NAS A’ s Global Geospace Science (GGS) program, 
the International Solar Terrestrial Physics (ISTP) 
program, and the International Heliospheric Study 
(IHS ). SSC data and software have recently been 
ported to the ISTP/GGS Science Planning and 
Operations Facility (SPOF) at NASA Goddard 
Space Flight Center. Precomputed state vector 
information, predictive or definitive, for the Geotail 
and WIND spacecraft is being provided by SSC to 
the IACG-WG3 (Working Group 3) satellite 
facility, called SPIN, at ISAS. Similar data will be 
provided to the Joint Science Operations Centre 
(JSQC) in the United Kingdom Rutherford- 
Appleton Laboratory for the planned Cluster 
mission and at the Experimenters Operations 
Facility (EOF) at GSFC for the SOHO mission. 
Other space physics data systems, including the 
new Geospace Environmental Data Display System 
(GEDDS) at Poker Flat Research Range [Akasofu 
et ah, 1 992], which integrates models and ground 
and satellite observations of the auroral ionosphere 
for studies of magnetosphere-ionosphere coupling, 
may be among future users of SSC data. 

2. SSC OPERATIONS AND FACILITIES 

Updated orbital elements for many satellites are 
routinely received by SSC on magnetic tape (three 
per week) from the United States Space Command 
(USSPACECOM), previously known as NORAD. 
Elements for satellites of interest to NASA sup- 
ported missions and international programs are 
processed by SSC staff into time-ordered Cartesian 
(X- Y-Z) coordinates, by using the Goddard Trajec- 
tory Determination System (GTDS) code main- 
tained at NASA Goddard’s Flight Dynamics 
Division, and stored in a data base within the SSC 
Software System in NSSDC’s Common Data 
Format (CDF) [Treinish and Gough, 1987], The 
Cartesian data points are stored at maximum time 
resolution of one minute in geocentric inertial 
coordinates. The orbital element data can be ac- 
cessed electronically through NASA Science 
Internet (NSI) at an anonymous FTP directory 
called ACTIVE. The Cartesian data are available 
either via the SSC Data System, which is accessed 
through NSSDC’ s On-Line Data Information 


Service (NODIS), or through a ported version of the 
data system software on a SUN workstation at the 
user’s home facility. 

SSC is supported by software designed to 
answer common queries about time periods during 
which one or more satellites and/or ground stations 
may be in conjunction on the same magnetic field 
line or in the same magnetospheric region as 
determined by comparison of spacecraft position 
and of field line footpoint locations in the iono- 
sphere. The software supports a wide range of user- 
selected options for internal and external magnetic 
field models, field line traces, and output coordinate 
systems. Other capabilities include data listings of 
satellite coordinates in one of several coordinate 
systems and identification of magnetospheric region 
at each point along the satellite track. Calculator 
functions support conversions of footpoint and 
satellite locations between different coordinate 
systems. 

The software and hardware capabilities of SSC 
have evolved over many years since the first 
generation of SSC programs was written in FOR- 
TRAN to run on a MODCOMP IV/25 computer for 
production of simple reports and data listings. In 
1975, an interactive graphics system was added for 
preconfigured plots of key orbital parameters. In 
1985, the software was ported to a MODCOMP 
Classic 11/45 computer after previous updates and 
additions in 1980 in support of the Dynamics 
Explorer mission. More recent upgrades have 
included ports to more powerful computers in the 
VAX/VMS and SUN/UNIX environments, the 
addition of new magnetic field models, and the 
revision of definitions for magnetospheric regions 
[Parthasarathy etal., 1992]. 

Whereas user queries were exclusively handled 
by SSC staff prior to spring 1 993, the emergence of 
the SSC Data System into the NSI network 
environment now makes possible direct access to 
SSC software and data by the space science 
community. The current Version 2.1 release of the 
SSC Software System, the software component of 
the data system, has a modern user interface 
supporting access by X-Windows (XI 1R5 
compatible) and VT- 1 00 compatible terminals, as 
well as a three-dimensional graphics capability for 
X/PEX environments supporting the PHIGS 
(Programmer’ s Hierarchical Interactive Graphics 
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Table 1. Time resolution and coverage of satellite data in the SSC Data System (as of August 1993). 


End Coverage Date 


Satellite 

Time 

Resolution 

Definitive/ 

Predictive 

Start Coverage Date 

End 

Name 

(Seconds) 

(D/P) 

YYYY 

DDD 

H.H 

YYYY 

ACTIVE 

60 

D 

1989 

272 

0.0 

1991 

AKEBONO 

60 

D 

1989 

54 

0.0 

1993 

APEX 

60 

D 

1991 

351 

0.0 

1993 

CCE 

60 

D 

1984 

230 

0.0 

1988 

CRRES 

60 

D 

1990 

206 

23.0 

1991 


60 

P 

1991 

277 

8.0 

1992 

DEI 

60 

D 

1981 

215 

10.6 

1991 

DE2 

60 

D 

1981 

216 

0.0 

1983 

DMSP-F10 

60 

D 

1990 

336 

0.0 

1993 

DMSP-F1 1 

60 

D 

1991 

333 

0.0 

1993 

DMSP-F8 

60 

D 

1990 

1 

0.0 

1993 

DMSP-F9 

60 

D 

1990 

1 

0.0 

1992 

FAST 

60 

P 

1994 

236 

0.0 

1995 

FREJA 

60 

D 

1992 

282 

0.0 

1993 

GEOTAIL 

720 

D 

1992 

208 

0.0 

1993 


720 

D 

1993 

259 

8.0 

1995 

GMS 3 

60 

D 

1986 

84 

0.0 

1986 

IMP 7 

720 

D 

1972 

270 

0.0 

1978 

IMP 8 

720 

D 

1973 

303 

0.0 

1996 

INTERBALL 

(AURORA) 

60 

P 

1994 

79 

0.0 

1996 

INTERBALL 

(TAIL) 

720 

P 

1993 

342 

0.0 

1996 

IRM 

720 

D 

1984 

256 

0.0 

1986 

ISEE 1 

720 

D 

1977 

296 

0.0 

1987 

MOON 

1728 

P 

1991 

182 

0.0 

1993 


1728 

P 

1993 

182 

0.0 

1996 

NO A A 12 

60 

D 

1991 

136 

0.0 

1993 

OGO 6 

60 

D 

1969 

156 

0.0 

1970 

OHZORA 

60 

D 

1984 

51 

0.0 

1986 

POLAR 

720 

P 

1994 

141 

12.6 

1996 

RELICT-2 

1728 

D 

1993 

349 

0.0 

1995 

SAMPEX 

60 

D 

1992 

186 

0.0 

1993 

SCATHA 

60 

D 

1979 

36 

0.0 

1991 

SOLAR-A 

(YOHKOH) 

60 

D 

1991 

248 

0.0 

1993 

UARS 

60 

D 

1991 

263 

0.0 

1993 

VIKING 

60 

D 

1986 

64 

0.0 

1987 

WIND 

720 

P 

1994 

75 

0.0 

1995 
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System) interface. X/PEX and PHIGS are available 
from the MIT X Consortium free of any license 
fees. The user interface was constructed using the 
JY ACC Applications Manager (JAM) , which 
provides a user-friendly menu-driven interface 
allowing the user to move quickly between different 
system components and to easily specify run-time 
parameters and output options. 

3. DATA SYSTEM DESCRIPTION 

3.1 User Access 

The SSC Data System may be accessed 
over NSI through DECnet or Internet network 
protocols. The first step is to log onto the 
NSSDC account nssdca: :nodis on DECnet or 
nodis@nssdca.gsfc.nasa.gov on Internet. A pass- 
word is not required in either case. After responding 
to initial login prompts (type “Y” for the new 
NODIS interface), the user selects the “Space 
Physics” option from a menu of major disciplines 
and then the “Satellite Situation Center” option 
from the space physics submenu. The user is 
prompted for a choice of terminal protocols includ- 
ing VT- 100, Macintosh with Versaterm Pro (VT/ 
Tek 4 1 05 emulation), Tektronix 4105, SUN 
Shelltool, and X-term. 

3.2 Main Menu 

The initial menu of the SSC Data System offers 
the following options: (1) Query, (2) Locator, (3) 
Calculator, (4) Database Information, (5) File 
Processing, (6) Usage Notes, and (7) Exit. Query 
invokes the Query Processor menu for potentially 
complex queries regarding field line and/or mag- 
netospheric region conjunctions of multiple satel- 
lites and ground stations. Locator invokes the 
Ephemeris Locator, which provides sequential 
listings in selectable Earth-centered coordinates of 
satellite position and region versus time, as well as 
an additional three-dimensional plot option (SUN- 
Windows terminals only) for orbital tracks in 
Geocentric Solar Ecliptic (GSE) coordinates. The 
Calculator option invokes the Coordinate Calculator 
for conversion of coordinates between several 
geocentric systems. Database Information gives the 
current list of satellites, time resolution, ephemeris 


type (usually predictive for pre-launch and defini- 
tive for post-launch), and associated date/time 
coverages. The current list, as of August 1993, is 
shown in Table 1 . The user should note, however, 
that older ephemeris data sets are now kept off-line, 
and a special request may be made to bring such 
data on-line. (Output) File Processing supports the 
viewing, deletion, printing, and FTP file transfer to 
the user’s home node of the contents of the working 
directory, including output files from previous runs 
of the various SSC programs. Usage Notes provide 
on-line help information. Exit returns the user to the 
NODIS account. 

3.3 Query Processor 

The main window for this processor is shown in 
Figure 1 for the SUN-Windows access option. For 
selection of one or more satellites and a start/stop 
time range to meet the specified criteria, a set of 
conditions on the ephemeris data query can be set 
up from this window with constraints on the mag- 
netospheric regions (Region Setup) and magnetic 
field line traces (Trace Setup). The user can specify 
up to nine sets of conditions for any one query, and 
a single query may require the current condition, 
any other condition, or all of the conditions. 

The region option enables listing of times 
during which individual satellites are in one of the 
selected magnetospheric regions. As shown in 
Figure 2, the Region Selection Window allows 
selection of all or only a few regions. In the cases of 
the dayside/nightside magnetosphere and 
plasmasphere regions and sub-regions (north, south, 
cusp, cleft, auroral oval, and polar cap within the 
magnetosphere), the region choice may exclude one 
or more sub-regions . 

Figure 3 shows an artistic view of the principal 
region locations in the Earth’ s magnetosphere. 
Analytic representations of the region boundaries 
are set within the software and have been deter- 
mined from scientific literature and from cumula- 
tive experience of, and continuing consultation 
among, scientific staff of SSC, the ISTP/GGS 
SPOF, IACG, STEP, and the scientific community. 
Pending consensus redefinitions of analytic forms 
for these boundaries, the region parameters will be 
updated from those defined for the Version 2. 1 SSC 
software i n Parthasarathy et al . [ 1 994] . 
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Figure I. X-Windows version of initial input window for Query Processor in the SSC Data System. 



Figure 2. X-Windows version of Region Selection window for Query Processor. 
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Figure 3. Artistic rendition of regions and boundaries in the terrestrial magnetosphere ( not to scale). 


The trace option enables field line tracing 
queries to identify either periods when one or more 
satellites are on the same magnetic flux tube as a 
specified lead satellite, or when one or more 
satellites occupy a field line tracing down near a 
specified ground station. The flux tube test is done 
within the specified time range at each orbital point 
(minimum time separation is 60 seconds) by tracing 
the local field line of each satellite down to a 
footpoint at a specified altitude above ground level 
and by calculating whether that footpoint is within a 
specified latitude-longitude interval or great circle 
distance of the specified lead satellite or ground 
station. Trace options include selectable models for 
calculation of internal and external magnetic field 
components at each trace point and selectable 
direction of trace to determine footpoints in the 
same, opposite, or both north-south hemispheres. 


Magnetic field models now available at NSSDC 
for the terrestrial magnetosphere have been re- 
viewed by Bilitza [ 1 992] . Internal models supported 
by Version 2. 1 of the SSC Data System include 
IGRF (Epoch 1965, 1975, 1980), MAGS AT (Epoch 
1980), Barraclough (Epoch 1975), and a centered 
dipole. External models include Tsyganenko 1987 
(select long or short tail, warped plasma sheet, 

Stern variation), Tsyganenko 1989 (select AE or Kp 
Long), Mead-Fairfield (select superquiet, quiet, 
disturbed, or super-disturbed), and Olson-Pfitzer 
1 974 (select tilt or no tilt). Model choices will be 
updated in the future. 

3.4 Ephemeris Locator 

The Locator supports options to display satellite 
location data in tabular or graphical format. In the 
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tabular format the user may request the location in a 
variety of coordinate systems and with a variety of 
corresponding parameters shown in the VT- 1 00 
window for tabular output options in Figure 4. The 
following coordinate systems are defined as in 
Russell [1971]: GEI - Geocentric Equatorial 
Inertial, GSE - Geocentric Solar Ecliptic, GSM - 
Geocentric Solar Magnetospheric, SM - Solar 
Magnetic, GEO - Geographic, and GM - 
Geomagnetic. The tabular output may be filtered by 
ranges specified for one or more satellite location 
parameters. 

Panels (a) and (b) of Figure 5 show samples of 
graphical three-dimensional and two-dimensional 
output in GSE coordinates, the X axis being di- 
rected towards the Sun, the Z axis being directed 
northward from and perpendicular to the solar 
ecliptic plane, and tick marks being given along the 
axis in units of Earth radii. In the two-dimensional 
plot (Figure 5b), the conical shape is the X-Z GSE 


projection of the Sibeck magnetopause boundary 
[Sibecketal., 1991] superimposed on the forty day 
tracks of Geotail, IMP 8, and WIND, based on 
predictive ephemeris data for 1994. The choice of 
view direction to give two-dimensional or three- 
dimensional plots is arbitrary. 

3.5 User Support 

The online Usage Notes provide some guidance 
in utilization of the SSC Data System menus, 
windows, and special keys. A user’ s guide is 
available upon request from NSSDC’ s Coordinated 
Request and User Support Office (CRUSO). The 
regular mail address is CRUSO, National Space 
Science Data Center, Code 633.4, NASA Goddard 
Space Flight Center, Greenbelt, MD 2077 1 , US A. 
CRUSO can also be reached by phone at (301)286- 
6695, Fax at (301)286-1771, or by NSI e-mail at 
request@nssdca.gsfc.nasa.gov or nssdca: request. 
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Figure 4. VT-100 version of Tabular Output Options window for Ephemeris Locator. 
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Figure 5. Graphical outputs from the Ephemeris Locator in GSE coordinates with (left) three-dimensional and (right) two- 

dimensional views. 


4. OTHER RELATED DATA SYSTEMS AT 

NSSDC 

The NODIS account contains several other 
options which the user may wish to investigate as 
complementary to the capabilities offered by the 
SSC Data System. Under the “Multidisciplinary” 
category in the initial NODIS menu, one can select 
the NAS A Master Directory for information on 
NASA data systems and mission data sets and the 
NSS DC Master Catalog for detailed information on 
spacecraft missions, experiments, and NSSDC-held 
data sets. In that same category, there are further 
options for information about access to data sets 
either directly online through NSSDC’s Anony- 
mous FTP account or near-line through the NSSDC 
Data Archive and Distribution System (NDADS). 
The ACTIVE directory on the Anonymous FTP 
account offers frequently updated listings of orbital 
elements and precalculated field line conjunction 
events for active scientific spacecraft of interest to 
the magnetospheric physics community. The 


NODIS “Space Physics” category offers direct 
access to the OMNI data base for near-Earth (e.g., 
IMP satellite series) interplanetary magnetic field, 
solar wind plasma, energetic particle, geomagnetic 
index, and solar activity data, and to a description 
of the COHO (Coordinated Heliospheric Observa- 
tions) data base for deep space interplanetary data 
from missions, such as Pioneer 1 0 and 1 1 , V oyager 
1 and 2, Helios 1 and 2, and Pioneer Venus Orbiter. 
Heliocentric ephemeris data for such missions, and 
for major planets, are also available through the 
ACTIVE and COHO directories on Anonymous. 
Such data are not currently supported by Version 
2. 1 of the SSC Data System interface, but may be in 
future releases. 
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We report on the development of an interactive system 
for visualizing and analyzing numerical simulation 
results. This system is based on visualization modules 
which use the Application Visualization System (AVS) 
and the NCAR graphics packages. Examples from 
recent simulations are presented to illustrate how 
these modules can be used for displaying and 
manipulating simulation results to facilitate their 
comparison with phenomenological model results 
and observations. 


1. INTRODUCTION 

With the advent of multispacecraft missions, 
such as the NASA Global Geospace Mission [e.g., 
Russell, 1 994], it has become obvious that theoreti- 
cal models are needed, not only to interpret local 
spacecraft observations, but also to link single point 
measurements. To these ends, mission theorists 
have been developing models that address space 
plasma phenomena on local and global scales, 
involve nonlinear processes, and use self-consistent 
approaches. Because of the complexity of these 
models, it has become standard to use numerical 
simulation techniques. 

In the past, numerical simulations have gener- 
ally been used to provide an indepth physical 
understanding of phenomena that had first been 
interpreted in terms of simple models and linear 
theory. As such, simulations came at the end of the 
intellectual process of interpreting spacecraft 
measurements, rather than at the beginning. From 
its inception, the International Solar Terrestrial 
Physics (ISTP) program coordinators recognized 
that it was necessary to develop the ability to use 
simulations at an earlier stage of the mission for use 
in the planning process. An additional challenge 
was to increase the realism of the simulation 
models so their results could provide quantitative 
predictions that could be effectively compared to 
specific spacecraft observations. 

In response to these challenges, ISTP theoreti- 
cal teams have been producing more and more 
sophisticated simulations by refining algorithms, as 
well as improving the temporal and spatial resolu- 
tion. The large volume of information produced by 
these numerical simulations has presented new 
problems in data management and analysis. The 
recent advances in both hardware and software 
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graphics technology have not only considerably 
helped to alleviate these problems but also have 
opened new and exciting possibilities. Visualiza- 
tion techniques have become invaluable, not only 
for displaying numerical simulation results, but 
more importantly for analyzing the implications of 
these results. 

2. NUMERICAL SIMULATION DATA SETS 

2.1 Simulation Techniques 

Since the 1 960s, numerical simulations of plasmas 
have been used extensively to study a vast array of 
physical problems, ranging from laboratory plasmas to 
interstellar plasmas. Numerous simulation techniques 
have been developed which depend on both the 
temporal and the spatial scales of the plasma phenom- 
ena investigated. 

One can distinguish two categories of approaches 
frequently used to model the time-dependent, 
collisionless plasmas normally encountered in space. 
The first class is made up of “kinetic” simulations. 
These simulations are the equivalent of following the 
evolution of the Vlasov equation together with 
Maxwell’ s equations by calculating the orbits of a 
large number of finite-sized particles in their self- 
consistent electromagnetic fields. There are many 
varieties of particle codes, including electrostatic, 
magnetostatic, and electromagnetic models [e.g., 
Hockney and Eastwood, 1981; Birdsall and Langdon, 
1985] . Since particle codes give a full description of 
the local plasma physics, they have been widely used 
to investigate the plasma instabilities generated by 
wave-particle interactions that occur in many regions 
of the magnetosphere. There are many additional 
particle simulation approaches, such as methods which 
implicitly integrate particle motion [e.g., Brackbill and 
Forslund, 1982], methods which retain only particles’ 
slow drifts by averaging over their more rapid 
gyromotion [e.g., Lee, 1987], and “hybrid” methods 
where one of the species (usually the electrons) is 
described as a fluid [e.g., Winske, 1985], 

At the other end of the spectrum are the fluid 
simulation models. Although collisions are infre- 
quent, correlation times are small enough to use 
magnetohydrodynamic (MHD) simulations to 
model large scale processes. These codes self- 
consistently solve a closed system of Maxwell’s 
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equations and ideal fluid equations as an initial and 
boundary-value problem. They have been used 
quite successfully to model local phenomena, as 
well as the global interaction of the solar wind with 
Earth’s magnetosphere [e.g., Oginoetal., 1986; 
Fedder and Lyon, 1987]. Fluid models based on 
higher moment transport equations are also being 
used to describe non-equilibrium and multifluid 
processes, which are particularly important in 
ionosphere-magnetosphere coupling and cometary 
mass loading [e.g., Schunk, 1977; Gombosi and 
Korosmezey, 1989]. 

MHD simulations have inherent limitations in 
rendering kinetic effects, such as gradient and 
curvature drifts. To evaluate the importance of 
kinetic effects in a large system other approaches 
have been used. One of them is based on the 
numerical integration of particle trajectories in a 
given set of electromagnetic fields and involves 
calculating physical quantities by deriving moments 
obtained from the particles’ distribution functions. 
Although the computed distributions are not neces- 
sarily consistent with the electromagnetic fields, 
these “large-scale kinetic” simulations have a 
distinct advantage because they can be used with 
analytical, as well as data-constrained models. 

They have been very useful in investigating the 
macroscopic signatures of kinetic processes, such as 
stochastic scattering in the magnetotail [e.g., 
Ashour-Abdallaetal., 1993]. 

The simulation techniques discussed above 
have been implemented in many different ways, 
depending on the specific area of magnetospheric 
physics to which each was applied. Details of the 
simulation algorithms currently being used in space 
plasma physics can be found in the proceedings 
from the International School for Space Simulations 
[Matsumoto, 1982; Ashour-Abdalla and Dutton, 

1 985 ;Lembege and Eastwood, 1988; Matsumoto 
andOmura, 1991]. 

2.2 Simulation Diagnostics 

A great variety of diagnostics is used to analyze 
simulation results. Most of the time these diagnos- 
tics involve multi-dimensional space and are 
presented graphically. Although a large fraction of 
them are generic, such as contour plots of electro- 
magnetic fields, densities, etc., some diagnostics are 
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more specific to the technique used to carry out the 
simulation. For example, diagnostics of most 
kinetic simulations include phase space representa- 
tions of particle distributions and wave spectral 
analyses to identify sources of free energy and 
wave-particle interactions. In fluid-type simula- 
tions, stronger emphasis is given to diagnosing 
topological properties of flows and electromagnetic 
fields by tracing, for example, stream lines and 
magnetic field lines. 

The main difficulty is not in calculating these 
diagnostics, but rather in implementing their 
computation with as little overhead as possible on 
the time-step cycle. This depends of course on the 
performance and capabilities of the computers and 
peripherals used, in particular the input/output 
speeds and the internal and external mass-storage 
capacities. In early numerical simulations, digital 
output was held to a minimum. Almost all diagnos- 
tics were embedded in the code because of slow 
output speeds and the rapid saturation of storage 
space. This limited the analysis that could be 
performed on the simulation results since diagnos- 
tics had to be implemented before the run. Recent 
progress in computer technology has considerably 
reduced these bottlenecks. Today most of the 
diagnostic routines can be dissociated from the 
simulations to speed up the execution time. Large 
data sets can be extracted from the computation and 


stored to be post-processed either as the computa- 
tion is executed or later on. As we will see later, 
the average size of an output file from current 
three-dimensional MHD simulations is of the order 
of 16 MB per snapshot in time. Files of this size 
can rapidly add up to massive data sets (1GB per 
simulated hour when following the evolution of the 
magnetosphere with a time resolution of one 
minute). 

2.3 Display of Simulation Results 

Despite the massive processing capabilities of 
modern computers, space research is still funda- 
mentally dependent on the investigator' s ability to 
visually recognize patterns in the data. Experimen- 
talists have invested a tremendous amount of effort 
in creating efficient displays for the meaningful 
representation of large data sets. It has long been 
recognized that to be effective, simulation results 
should, to the greatest extent possible, be presented 
in formats that parallel the data displays used for 
spacecraft measurements [e.g., Ashour-Abdaila et 
al., 1994] to help assess whether theoretical predic- 
tions are consistent with spacecraft measurements. 

We have already started to produce simulation 
results in an experimental type of format. Figure 1 
shows results of a two-dimensional electrostatic 
simulation of an ion beam-driven system which was 


Electron Cyclotron Harmonics 



Figure I. Results of a two-dimensional electrostatic simulation showing wave diagnostics displayed in formats similar to those 
used to plot measurements from wave experiments onboard spacecraft. The right panel exhibits frequency-wave number 
dispersion relations color coded according to wave power. 
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designed to model the generation of broadband 
electrostatic noise (BEN) observed in the plasma 
sheet boundary layer [Gurnettet al., 1976], Al- 
though BEN is thought to result from the electron 
acoustic and ion-ion instabilities generated by field- 
aligned beams [e.g., Schriver and Ashour-Abdalla, 
1990], we display here only the part of the wave 
analysis that indicates that electron harmonic 
cyclotron (ECH) waves can be simultaneously 
enhanced during the process [Berchemetal., 1991]. 

The left and center panels of Figure 1 show the 
results from the simulation displayed in formats 
similar to those used to plot Sweep Frequency 
Receiver (SFR) spectrograms and power spectra 
obtained from wave experiments onboard 
spacecraft. The right panel exhibits frequency- 
wave number dispersion relations color-coded 
according to wave power. Although wave 
instruments do not obtain a direct measurement of 
wave numbers, this plot shows how we can exploit 
the extra information produced by simulations. By 
comparing this diagram to linear wave theory we 
can clearly recognize the classical dispersion 
relation expected for ECH waves and positively 
identify the peaks observed in the spectrogram and 
the power spectrum. 

Recent advances in both hardware and software 
have provided new possibilities for analyzing 
simulation results other than simply duplicating 
experimentalists’ formats. As we will see in the 
next section, it is now relatively straightforward to 
design and implement interactive visualization 
environments where simulation results can be 
analyzed in the context of observations to realize 
synergistically the goals of mission-oriented theory. 

3. THE UCLA MISSION-ORIENTED 

VISUALIZATION SYSTEM 

3. 1 The Development of the Mission-Oriented 

Visualization System 

The UCLA Mission-Oriented Visualization 
Systemis aninteractive post-processor of simulation 
results that uses the commercial Application 
Visualization System (AVS) software for three- 
dimensional volume rendering and the N CAR graphics 
package for two-dimensional plots. AVS isamodular 
programming environment based on a data-flow 
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network architecture. Originally developed in 1989 by 
the Stardent Computer Company, this software allows 
users to build visualization networks by simply 
connecting individual modules using mouse-driven 
point-and-click operations. These modules include a 
variety of data-input, filtering, mapping, and rendering 
output routines that provide a comprehensive set of 
visualization tools. In addition to providing an 
intuitive and easy-to-use interface for configuring, 
modifying, and saving networks, the AV S architecture 
lets the user create custom modules to control any 
aspect of A V S . This capability provides the true 
power of the AV S package. Being able to write and 
incorporate modules has allowed us to design our own 
application system to take full advantage of the A V S 
generic visualization environment while meeting our 
needs in displaying and analyzing space plasma 
simulation data sets. This task can be greatly 
facilitated by using the AV S module generator that 
creates skeletons into which users can insert specific 
algorithms. 

Our system development took place on two 
levels. Most of the standard modules (aside from the 
basic tools such as color maps, slicers, and viewers, 
etc.) are either too general or too limited for 
developing a specific application, although they are 
very useful at the prototyping stage. The first part of 
the development was to design and implement new 
modules that were needed to visualize and analyze 
our simulation results in a magnetospheric context. 

In the next section, we will give a few examples of 
modules that were designed for analyzing three- 
dimensional global MHD simulation results. The 
second part of the system development was focused 
on improving the user interface. Although the AVS 
interface is intuitive and easy to learn, it can be 
tedious and frustrating to assemble complicated 
networks or just to remember the appropriate 
sequential combinations of operations required to 
perform certain tasks. Most of the time, predefined 
menus are easier to work with as they provide 
straightforward access to the most frequently used 
analysis routines. This approach also allows users 
who know only minimal AVS syntax to perform 
some analyses quickly. Nevertheless, we designed 
the interface such that primary networks remain 
accessible to users who are more familiar with AVS 
in order to let them incorporate other elements into 
their analysis. 
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3.2 Global MHD Simulations 

The results that we use to illustrate our visual- 
ization system were obtained from three-dimen- 
sional MHD simulations of the solar wind interac- 
tion with Earth’s magnetosphere. These global 
simulations were carried out specifically to show 
how they could help locate the four spacecraft of 
the CLUSTER mission (launch planned for end of 
1995) with respect to the various magnetospheric 
boundaries [Berchemetal., 1993]. Ourpurposeat 
this stage was not to conduct a systematic study, but 
to provide practical examples of what could be 
done for mission planning. We imposed two simple 
upstream boundary conditions (northward and 
southward) on the incoming interplanetary mag- 
netic field (IMF) and described the ionospheric 
boundary of the system by using a conductivity 
tensor obtained from an analytical model 
[Rasmussen and Schunk, 1987], We focused on a 
single period of the year (the vernal equinox) by 
specifying in the model the actual orientation of the 
geographic and magnetic dipole axes for that 
period. 

The algorithm used in this simulation solves 
one-fluid ideal MHD equations with an explicit 


conservative predictor-corrector time-stepping 
scheme and hybridized numerical fluxes for fourth 
order spatial finite differencing. The computational 
mesh is rectangular (x,y,z) but nonuniform, and 
contains 10 6 grid points. For the results shown 
here, we designed a special grid to obtain maximum 
resolution (0.4 Re) in the cusp regions. The dimen- 
sions of the simulation box are 32 Re in the 
sunward direction, 80 Re along the tail and ± 35 Re 
in each transverse direction. Simulations using 
larger system sizes (up to 400 Re tailward) have 
also been carried out for distant tail studies in 
support of the ISTP GEOTAIL mission [e.g., 
Raederetal., 1993]. Basic information on the 
initialization procedures and boundary conditions of 
the simulations reported here can be found in 
Berchemetal. [1993]. 

3.3 Visualization of Global Magnetospheric 

Configurations 

Figure 2 shows simulation results obtained after 
one hour of real time has elapsed during which a 
steady northward IMF boundary condition has been 
maintained to convect the unphysical initial state 
outside of the simulation system. Only a small part 



Figure 2. Perspective view of two-dimensional color-coded contours of the logarithm of plasma pressure obtained from three- 
dimensional global MHD simulation for northward IMF. 
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( 100 x 50 x 50 Re) of the simulation system is 
represented by a three-dimensional perspective 
view of two-dimensional color-coded contours of 
the logarithm of plasma pressure. These cuts are 
obtained by using standard AVS orthogonal slicers. 
One cut is taken along the meridian x-z (GSE) 
plane, whereas the two others are taken orthogo- 
nally to that plane at x = 30 and 75 Re- The strong 
skewing of the magnetotail results from the tilt of 
the magnetic dipole axis. 

W e traced two-dimensional field lines obtained 
from the magnetic field vectors projected onto the 
meridian contours of the plasma pressure. Al- 
though they are not strictly magnetic field lines 
because of the tilted dipole axis used here, these 
traces indicate clearly that reconnection with the 
solar wind field is occurring in the nightside high- 
latitude region, as is expected during periods of 
northward IMF. Since the tilt angle is small, the 
lines also help in visualizing the different magneto- 
spheric regions. Starting at the left with a low 
pressure solar wind region (=5pPa; deep blue), we 
observe successively the bow shock, marked by a 
narrow layer of increasing pressure (yellow), a high 
pressure magnetosheath region (=600pPa; red), 
another tran sition region of intermediate pressure 
corresponding to the magnetopause and the low- 
latitude boundary layer (orange and yellow), the 
plasma mantle (green and turquoise) where the 
pressure starts to decrease towards its minimum 
magnetospheric value in the lobe (deep blue), then 
the plasma sheet boundary layer where it rises again 
(turquoise) and finally the central plasma sheet with 
a pressure comparable to its mantle value (green). 
Note that the deep blue 3.7 Re circular region with 
its center at Earth delimits the inner boundary of the 
simulation model. 

When analyzing a snapshot of three-dimen- 
sional MHD simulation results, it is very important 
to work from a global understanding of the state in 
which the simulated magnetosphere has evolved 
before moving on to a deeper analysis. Identifying 
regions where magnetic field energy is converted 
into plasma energy or the reverse, is one of the 
fundamental parts of this process. Tracing mag- 
netic field lines is of course essential in localizing 
merging processes. Nevertheless, other diagnostics, 
such as plotting the electromagnetic quantity (E»J) 
or the ratio of the plasma pressure over the mag- 


netic field pressure (plasma beta), are also very 
instructive. We implemented a module that can 
extract and calculate these physical quantities, 
which can then be displayed by using standard AVS 
modules or fed to other modules for further ma- 
nipulation. From the five primary fields routinely 
extracted from the computational grid (plasma 
pressure, density, and velocity; magnetic field and 
current density), more than 30 quantities are de- 
rived. Some of these quantities are just vector 
components and magnitudes of primary quantities 
or straightforward plasma parameters and are 
computed interactively. Other quantities, such as 
the magnetic field topology parameter (this param- 
eter indicates whether a grid point is located in an 
open, a reconnected, an Earth-closed, or a loop- 
closed magnetic field line), are more involved and 
need to be either generated during the simulation 
run in addition to the regular grid parameters or 
post-processed. 

3.4 Visualization of Three-Dimensional Local 

Structures 

Although they require further refinement, 
global MHD simulations are now sufficiently 
sophisticated that they can reproduce small scale 
features. The substantial increase in spatial resolu- 
tion obtained during the past few years has been 
made possible not only by improving algorithms 
but also using more and more powerful computers 
such as massively parallel machines. The simula- 
tions shown here, for example, were run on the San 
Diego Supercomputer Center (SDSC) Intel iPSC/ 
860 and Paragon machines. These new capabilities 
prompted us to design modules to facilitate the 
visualization of local processes. 

One of these modules is our interactive field 
line tracer. As we mentioned before, tracing 
magnetic field lines is very important for 
understanding the global topology of the 
magnetosphere; it can be very useful for local 
studies as well. Although a certain number of field 
(or stream) line tracers is available, they are not 
flexible enough to carry out such local analyses. 

The three-dimensional topology of magnetospheric 
processes can be fairly complex. For example, the 
formation of magnetic flux ropes and closed loops 
can lead to endless field lines that are sources of 
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problems in terms both of computational speed and 
memory space. Selecting the field lines to be 
plotted presents another difficulty. In most cases, 
one attempts to isolate and follow bundles of field 
lines rather than using large scale systematic 
displays which result in intricate and unintelligible 
plots. 

Our field line tracer module uses a mouse- 
driven point-and-click technique to determine 
interactively the starting points of the individual or 
group of field lines to be traced. In addition, the 
integration step, the length of the field line and the 
number of segments to be displayed can be ad- 
justed. Field lines are automatically color coded to 
indicate whether they are open, closed, etc. This 
module is thus particularly useful for visualizing 
magnetic field lines, but it can be used as well to 
study the topology of any of the vector fields 
calculated or derived from the simulations. We 
have already shown (Figure 2) an example of an 
application in which we used the vector field 
obtained by projecting the magnetic field onto a 
plane. Figure 3 shows another example where we 
have used the field line tracer to obtain a three- 
dimensional rendering of the dayside magnetopause 
Chapman-Ferraro current system. 



137 

In Figure 4, we display an example of the field 
line tracer used in conjunction with an isosurface 
representation to investigate the three-di mensional 
structure of the magnetic cusps for northward IMF. 
The isosurface of the plasma pressure representing 
the dayside magnetospheric boundary is viewed 
from the sun (Panel a) and from above the northern 
cusp (Panel b). We have traced only the magnetic 
field lines originating from the northern cusp region 
to clarify the representation. Careful examination 
of these views reveals that two large funnel-like 
structures are formed in each cusp region. One of 
the funnels is located at lower latitudes and is 
populated only by inner closed field lines, which 
are shown in black. The second funnel, found at 
higher latitudes, is composed of a mixture of closed 
(blue) and open (yellow) field lines. The open field 
lines drape over the closed ones; their open ends are 
located in the southern hemisphere. These field 
lines are newly reconnected field lines which are 
convected along the magnetospheric flanks. Analy- 
sis of flow patterns and the ionospheric electrostatic 
potential (not shown here) suggests that this type of 
structure in the plasma pressure appears because the 
dipole tilt destroys the symmetry of the convection 
patterns [Berchemetal., 1993]. 



Figure 3. Rendering of the Chapman Ferraro current system. Panel (a) shows the contours of plasma pressure in the x-z meridian 
plane that were used to select points (shown by the green spheres) located at the magnetopause, whereas Panel (b) shows traces 
of the current lines passing through these points. 
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Figure 4. The isosurface of the plasma pressure representing the day side magnetospheric boundary is viewed from the sun (Panel a ) 
and from above the northern cusp (Panel b). We have traced only the magnetic field lines originatingfrom the northern cusp region 
to clarify the representation. 
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3.5 Data Closure Through Visualization 

As we have seen above, the visualization of 
both global and local magnetospheric structures is a 
very important tool for studying the evolution of the 
magnetosphere. However, except for being intel- 
lectually satisfying, a qualitative comparison 
between these structures is not sufficient to realize 
the data closure step necessary to improve the 
model. Comparing actual satellite measurements 
with simulated data streams from virtual spacecraft 
in fact may provide the most direct way of deter- 
mining whether the phenomena represented by the 
simulation are the best elements to use for interpret- 
ing the spacecraft observations. To add this impor- 
tant capability to our system, we designed a special 
module that displays real or fictitious spacecraft 
trajectories and plots the simulation results along 
them. Input to this module can be in the form of an 
orbit file or free-hand selection using a mouse- 
driven point-and-click technique. In addition, we 
can compute interactively the values from the data- 
constrained Tsyganenko magnetic field models 
along the orbit and superimpose them onto the 
plots. This module uses the NCAR graphics 
package because it offers more versatility than AVS 
does in displaying two-dimensional plots. In 
addition to being displayed in a screen window, the 
data stream can be written to PostScript and ASCII 
files and then printed or further analyzed by tradi- 
tional time series analysis programs. 

Figures 5 and 6 show an example of the output 
obtained for one of the inclined (8 x 22 Re) CLUS- 
TER trajectories that has been considered in mis- 
sion planning. In Figure 5 we have superimposed 
the spacecraft trajectory over the contours of the 
logarithm of the plasma pressure in the orbital 
plane. Only the front part (25 x 25 x 25 Re) of the 
simulation system is displayed here, viewed from 
the duskside at about 20:00 LT. The spacecraft 
trajectory is shown by small red spheres linked by a 
white line. The green and yellow spheres indicate 
the start and end of the trajectory respectively. 
Simulated data streams are shown in Figure 6 for 
both southward (left) and northward (right) IMF. 
From the top to the bottom, we plotted the three 
components (GSE) and magnitudes of the magnetic 
field (nT), plasma flow velocity (km/s), density 
(cnr 3 ), pressure (pPa), and plasma beta. To assist 



Figure 5. Spacecraft trajectory superimposed over the contours 
of plasma pressure in the orbital plane. 


CLUSTER 8x22 96-03-19-7:30 to 96-03-22-14:00 
IMF Bz < 0 IMF Bz > 0 



Figure 6. Simulated data streams for both southward ( left) and 
northward (right) IMF. 

in making comparisons, we superimposed (dashed 
lines) the magnetic field obtained from the 
T syganenko [1989] model for a time at the center of 
the time period used to calculate the spacecraft 
trajectory and option 2 (K p = 1-, 1,1+). Note that 
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the values from the magnetic field model 
[Tsyganenko, 1989] are not valid beyond the 
magneto-pause. Locations of the spacecraft are 
given at the bottom of the figure; R is the geocen- 
tric distance (Re), LT the local time and LAT the 
latitude (degrees); all are calculated in GSM 
coordinates. 

Although the resolution of the plots shown here 
is relatively low because we had to accommodate 
the large variations observed along the entire 
trajectory, we can easily follow the virtual space- 
craft through the different magnetospheric regions 
and identify their boundaries. Special ports to the 
module have been designed to superimpose space- 
craft data when they become available. Since the 
period of steady solar wind modeled here is prob- 
ably too long, we did not attempt to make any 
specific comparisons with existing data. However, 
we anticipate that this system will be even more 
valuable when we have observational data for 
which solar wind conditions are known, as these • 
can be used as input parameters for the global 
simulations. This will allow us to move to the next 
step, that of thoroughly testing the predictive 
capabilities of our models. 

4. SUMMARY 

We reviewed here examples illustrating a few 
of the capabilities of the interactive visualization 
system that we developed to analyze the results of 
mission-oriented simulations. These examples 
show that recent advances in hardware and software 
graphics technology have opened new and exciting 
possibilities. As in many other areas, visualization 
techniques have become an invaluable tool for 
space physicists. They can be used to interact with 
the results of very complicated simulation models 
in order to grasp the fundamental physics involved 
and compare their results to observations. With 
these new interactive capabilities, visualization has 
also evolved from being the end product of the 
physical analysis (the “pretty picture”) to becoming 
an integral part of the intellectual process of explor- 
ing, testing, and communicating simulation results. 
We have seen through the examples shown here 
that simulation and visualization techniques have 
reached the stage where together they can provide a 


very powerful tool to interpret space plasma obser- 
vations and realize the synergistic theory-data 
closure necessary to improve further our theoretical 
models. We anticipate that this tool will also be 
very useful for planning future missions, as well as 
in defining the best strategy for coordinated theory- 
data campaigns. 
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Using the Application 
Visualization System To View 
Haloe Three-Dimensional 
Satellite Data 
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KENNETH A. STONE 4 , RALPH J. CICERONE 1 

The Application Visualization System (AVS) is used to 
view a three-dimensional data field containing the vol- 
ume mixing ratios of a chemical species in the middle 
atmosphere obtained by the Halogen Occultation Ex- 
periment (HALOE) aboard the Upper Atmosphere Re- 
search Satellite (UARS). Since launch in September 
1991, HALOE has been collecting data on approxi- 
mately 30 sunrise/sunset events in two narrow latitude 
bands each day. The vertical volume mixing ratio 
profiles are retrieved for eight species for each event. 
The accumulated data for approximately 30 days cover 
most of the globe (limited by sunlit latitudes), and this 
monthly data block can be described as the volume 
mixing ratio of a specific species in the atmosphere as a 
function of latitude, longitude, and height. Thedatawere 
remapped using linear interpolation for pressure levels 
and Gaussian weighted binning from sampling locations 
to a three-dimensional grid. An AVS network is con- 
structed that allows for viewing the three-dimensional 
field with rendered slices at constant latitudes, longi- 
tudes or pressure levels. Discussions are given on the 
advantages and some disadvantages learned about from 
experiences applying AVS to visualize HALOE three- 
dimensional data. 


1 Department of Earth System Science, University of 
California, Irvine, CA 

^Office of Academic Computing, University of California, 
Irvine, CA 

3 Atmospheric Sciences Division, NASA Langley Research 
Center, Hampton, VA 

4 G & A Technical Software, Inc., Hampton, VA 


1. INTRODUCTION 

As many geophysical scientists are beginning to 
realize, the analysis of the fast growing atmospheric 
observation data set requires the support of com- 
puter software tools that allow them to view, 
manipulate, and display the data in a visual form 
before presenting a verbal description. The advan- 
tages of applying satellite-borne instruments to 
remotely sound atmospheric properties are that the 
measurements can cover a large latitude/height 
range and the result can be time dependent. These 
three-dimensional, time-dependent data sets have 
never been obtained before. It is important for 
scientists to attain access efficiently. Several 
software packages have been developed in recent 
years for scientific visualizations, and they are 
improving dramatically to satisfy a user’ s demands. 
In this paper, we share our experiences using one 
visualization package to study a three-dimensional 
data set. 

The Halogen Occultation Experiment 
(HALOE), one of the nine instruments aboard the 
Upper Atmosphere Research Satellite (UARS) 
launched in September 1991, has been making 
measurements of a total of eight stratospheric/ 
mesospheric chemical species at every spacecraft 
sunrise and sunset event (Russell etal., 1993a). We 
describe the HALOE observations in more detail in 
the next section. A HALOE monthly data block for 
a species can be described as its volume mixing 
ratio as a function of latitude, longitude, and 
pressure. Although HALOE does not provide daily 
global maps for its measured species because of the 
geometrical constraints described below, in most 
cases the monthly data block may represent the 
characteristic of a species for the corresponding 
season. 
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We chose the Application Visualization System 
(AVS) to view HALOE monthly three-dimensional 
data blocks. The main visualization strategy of 
AVS requires users to select the “building blocks” 
(modules) and construct a “building” (AVS net- 
work) to show the data in a way that is scientifically 
meaningful. The AVS provides many modules 
while also allowing users to write their own. This 
paper describes the ways we used AVS to view the 
three-dimensional data set. The last section dis- 
cusses the advantages and disadvantages of this 
approach based on our experiences. 

2. HALOE OBSERVATIONS AND LEVEL 3 

DATA PROCESSING 

By the 1993 spring American Geophysical 
Union (AGU) meeting, the HALOE on the UARS 
had already been successfully making measure- 
ments on atmospheric temperature and chemical 
species for more than 20 months. This experiment 
uses the sun as its radiation source and measures 
atmospheric absorptions in selected infrared spec- 
tral regions between 2.4 and 10 pm. It provides 


vertical mixing ratio profiles for O 3 , CH 4 , H 2 O, 
NO, NO 2 , HC 1 , HF, and aerosol extinction at every 
spacecraft sunrise and sunset event. More detailed 
information on HALOE observation principles, 
instrument construction, and data inversion tech- 
niques are in a paper by Russell et al. (1993a). 
Because UARS has a 57° inclination near-circular 
orbit and it takes 98 minutes to circle around the 
Earth at 585 km, HALOE obtains about 15 sunrise 
and 15 sunset measurements approximately every 
24 hours. The approximately 30 sunrise/sunset 
tangent point (defined as the point of closest 
approach of a solar ray path to the Earth surface 
along the limb) locations for a given day are 
distributed in two narrow latitude bands. 

Because of the complicated interplay of space- 
craft path and the orientation of the Earth’ s limb, 
the time-varying sampling points are distributed in 
a complex pattern. Figure 1 shows HALOE daily 
averaged sunrise and sunset latitude positions for 
the 30-km tangent point altitude (TPA) as a func- 
tion of time between September 8 and November 7, 
1992. We select this period when the Antarctic 
ozone in the lower stratosphere is chemically 


HALOE DAILY AVERAGED 30KM TPA LATITUDE COVERAGE 



UARS DAY IN 1992 (Starts on Sept. 12, 1991 - Day 1) 
©Sunrise 
+ Sunset 


Figure I. HALOE daily averaged 30-km tangent point latitude coverage for the period between September 8 ( UARS day 363) and 

November 7 , 1992 (UARS day 423). 
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removed from the existence of manmade chlorine 
compounds because it is of great interest to atmo- 
spheric scientists, as well as to the general public. 
During this period, HALOE observations cover 
from its northernmost latitude (about 75°N) to 
southernmost (about 78°S) in 20 to 30 days. As an 
example, data between September 8 and October 4 
cover most of the globe, which is quite useful in 
studying seasonal behavior of the chemical species. 
Likewise, Figure 2 is the HALOE sunrise 30-km 
TPA positions between September 8 and October 4, 
plotted on the orthographic projection of the Earth’s 
southern hemisphere. Along every latitude band for 
a day, 15 sunrise measurements are uniformly 
distributed in longitude. 


90 (90E) 



Figure 2. HALOE 30-km tangent point coverage between 
September 8 and October 4, 1992. Sunrise measurements are 
available for this period in the southern hemisphere. 


The HALOE data base consists of several levels 
of data: level 1, the raw data; level 2, retrieved 
temperature and the volume mixing ratios of seven 
chemical species and aerosol extinction as a func- 
tion of pressure, latitude, and longitude at measure- 
ment locations; and level 3, gridded data that have 
been remapped into a three-dimensional grid of 
latitude, longitude, and pressure level. The data 
processing from level 2 to level 3 can be described 
as follows. First, data are interpolated onto UARS 
standard pressure levels (mb), defined as 


P= 1000 x lO-b/6), 1 = 0, 1, 2, ... 35. (1) 

The pressure spacing is Alog(P) = - 1/6, equivalent 
to about 2.5 km. This is about the same as HALOE 
level 2 data vertical spacing for its gas filter chan- 
nels (CH 4 , NO, HC1, and HF). The HALOE 
radiometer channel’s vertical spacing is about 0.3 
km (O3, H2O, and NO2). 

Second, on a constant pressure surface, data are 
averaged using a Gaussian-weighted binning 
method. The gridded data spacing was chosen to be 
1 0 latitude by 2° longitude. The normalized 
Gaussian function applied to the data point inside a 
bin is 

G = exp { - [ (x/g x ) 2 + (x/o y ) 2 ] / 2 } , (2) 

where x and y represent the differences of longitude 
and latitude between the data grid and the sampling 
position. The standard deviations are selected to be 
1 5.0° longitude (o x ) and 5.0° latitude (a y ). The bin 
width used is two times the standard deviations in 
longitude and latitude, respectively. The volume 
mixing ratio at a grid point (Z ij) is therefore calcu- 
lated by 

^ _ L [G x Z] (within the bin) 

V Z G (within the bin) 

where Z represents a sampling data point within the 
bin. 

HALOE level 3 data provide three-dimensional 
gridded data blocks for any selected period. The 
use of a state-of-the-art scientific visualization 
package helps us view, manipulate, and analyze the 
data. The specific values used for parameters in the 
binning method can be adjusted. Several tests were 
performed to study the effect of these parameters on 
our analysis (for example, to change the values of 
a x and G y and bin width). Data visualization tools 
will no doubt add power to the tests. We will not 
discuss this analysis in this paper because it is not 
one of the main goals of the paper. 

3. ABOUT AVS AND ITS APPLICATION AT 

THE UNIVERSITY OF CALIFORNIA, 
IRVINE 

Advanced Visual Systems, Inc., created AVS as 
a software package for scientific data visualization. 
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Stardent Computer Corp. originally designed it for 
software developers designing complex software 
packages with graphical interfaces. The subsequent 
use of AVS has shown that it is quite useful as a 
program for scientific visualization and 
interpretation of data by the much larger scientific 
user community. At present, AVS is available on 
most UNIX-based computer workstations (HP, 

IBM, SGI, DEC, Sun, Kubota, and Data General), 
as well as the larger supercomputers (Convex, Cray, 
Thinking Machines, etc.). 

AVS was originally introduced to the University 
of California, Irvine, through the introduction of the 
then-powerful Stardent Titan line of workstations. 

As newer versions of AVS became available on 
other computer platforms, the use of AVS at the 
university has greatly increased. Many research 
groups across the spectrum of research interests 
(medical imaging, neuroscience, combustion 
engineering, fluid mechanics, groundwater pollution, 
astrophysics, etc.), as well as geophysics, have been 
making substantial progress understanding their data 
and simulations using AVS. Several graphics 
laboratories have been set up using AVS on DEC, 
Stardent, and Convex computers. 
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The usefulness of AV S for scientific data 
visualization surfaced from the visual and data flow 
interfaces of AVS. The visual interface allows the 
system user to design the kind of visualization tool 
needed for his or her particular research through a 
series of standard point-and-click operations, which 
combined with the data flow structure of AVS, 
guide a user into building a visualization “network” 
from a set of simpler “module” building blocks. 
Using AVS for the first time therefore entails 
learning the visual interface functions, searching 
the module library for the functions that are needed, 
and importing one’s data into an AVS format that is 
based on a mathematical vector field (one or more 
data values at points defined in a one- to three- 
dimensional geometry). 

To most geophysical scientists, learning new 
computer software means investing a large amount 
of time. The compensation for the use of AVS is its 
general application across a wide variety of data. 
Generality allows AVS to take advantage of many 
types of modern visualization techniques 
(renderings) as well as mathematical operations 
(such as image processing). Most importantly, AVS 
allows the creation of new modules based on the 



Figure 3. An example of the AVS network (connection of AVS modules) used to view HALOE three-dimensional data. 
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high-level computing languages C and FORTRAN, 
known to most scientific users. 

4. THE USE OF AVS TO VIEW THREE- 

DIMENSIONAL HALOE DATA 

As described in section 2, the HALOE data 
block between September 8 and October 4 covers 
latitudes from 72S to 72N with 1° x 2° as latitude 
and longitude spacings, and we consider pressure 
levels between 100 (—16 km) and 1 (—48 km) mb, 
with A/og(P) = -1/6 as spacing. Therefore, the total 
number of data points (the volume mixing ratio of 
ozone, for example) in this data block is 1 8 1 x 145 
x 13 (longitude x latitude x pressure). 

Figure 3 shows a simple AVS network that 
connects basic software components (modules) 
needed to examine our three-dimensional data 
block. This network looks like a flow chart — data 
are read in first and then are displayed. The key 
module used in the network is the “orthogonal 
slicer,” which searches for data at selected constant 
longitude, latitude or pressure levels. The “gener- 
ate colormap” module is used to define the color 
scale according to the data value range. One can 
turn on the “animated integer” module is used to 
define the color scale according to the data value 
range. The “animated integer” module to dynami- 
cally display the three-dimensional data, for in- 
stance, moving the orthogonal slicer at constant 
pressure surface through the entire range of pres- 
sure levels with the desired number of steps. The 
explanation of other modules in Figure 3 can be 
found in the AVS user’s guide, the AVS on-screen 
help menu, or the following examples. 

HALOE level 3 CH 4 data are examined using 
the AVS multi-orthogonal slice module, as shown 
in Figure 4. Figure 4(a) illustrates three ways to 
examine the data block, placing a rendered slice at 
any desired constant latitude, longitude, or pressure 
level. Figure 4(b) is more interesting scientifically 
because it shows two rendered slices of CH 4 mixing 
ratios at southern and northern high latitudes, 
respectively. CH 4 is often treated as a dynamical 
tracer for transport processes in the Earth’ s atmo- 
sphere (more specifically, stratosphere extending 
from 100 mb to 1 mb) because it is not photochemi- 
cally produced in the atmosphere and its strato- 
spheric lifetime is about the same order of magni- 


tude as the time constants for transport by the 
meridional winds. At northern high latitudes, CH4 
is uniformly distributed along a latitude circle. In 
contrast, CH 4 shows strong longitudinal asymmetry 
at southern high latitudes in austral early spring, the 
period that we selected. It is important for atmo- 
spheric scientists to monitor the behaviors of 
“tracer gases” such as CH 4 during the course of 
Antarctic winter and spring. The evidence of low 
CH 4 within the polar vortex in the lower strato- 
sphere, where ozone chemical destruction occurs, 
indicates that the air inside the vortex experienced 
significant descent prior to the observation . 

To properly orient the data to geographical 
features, we created an AVS module (using FOR- 
TRAN) to convert the input data to an AVS field 
where each data value was mapped to the corre- 
sponding x,y,z position based on its latitude, 
longitude, and pressure level [the radial spacing of 
the data value is set by an adjustable scale factor 
multiplying the pressure index (1 in equation 1 )] . 
While the resulting pressure level surface is merely 
a relative measure of the real pressure level, the 
latitude/longitude information is directly compa- 
rable to the continental geometry. Among the many 
types of possible renderings of this remapped data, 
we mainly used constant pressure surfaces (see 
Figure 5) because their comparison with land 
masses is obvious. 

A useful AVS function is general object ma- 
nipulations. In our case, the global ozone data at a 
constant pressure level can be manipulated simply 
by clicking and moving the computer mouse. 

Figure 5 shows two snapshots of our “object” while 
we moved it around. With a little practice, users 
can easily learn three ways of manipulating the 
object-translation, rotation, and scaling. Through 
rotating our ozone surface, we could examine 
global ozone data and make comparisons between 
different latitude regions. 

HALOE data show that the ozone-depleted area 
is strongly correlated with southern polar vortex 
location. Here, the “tracers” measured by HALOE 
show air descending, and this dynamically isolated 
and chemically disturbed area is not zonally sym- 
metric over the pole in late September and early 
October 1992. The “ozone hole” has a large 
geographic coverage extending to the southern tip 
of South America. In the Northern Hemisphere, 
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HALOE CHi MIXING RATIO SEPT. 8 - OCT. 4. 1992 



1 0 (48km) 

Pressure (mb) 

ICC (16km) 

72N 


Longitude (deg) 


72S 


-ISO 


(b 
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Latitude ( deg ) 


1.0 (48km) 
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I00(l6kiu) 
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no data 0.1 0.3 0.5 0.7 0.9 11 1-3 1.5 >1 5 
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Figure 4. HALOE-measured CH 4 volume mixing ratios in the period between September 8 and October 4, 1992 — (a) an example 
of using rendered slices to view the data block at arbitrary constant latitude, longitude, or pressure level; (b) two slices of CH 4 
at 71 °S and 71 °N, respectively, showing the different characteristics of CH 4 along the two latitude circles. 
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Figure 5. HALOE-measured ozone volume mixing ratios at 32 MB in the time period between September 8 and October 4, 1992. 
A VS allows users to manipulate the "object" (rotation and translation, etc.) by clicking and moving the computer mouse. Two 
snap shots of global ozone data are shown here. In the top image, the outline of South America is just above center. The bottom 
image looks down on North America. 
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however, HALOE ozone data show zonal symmetry 
when summer turns to winter. More detailed 
scientific analysis of HALOE measurements in 
polar regions appears in other publications (such as, 

Russell et al., 1993b; Tucket al., 1993). 

5. DISCUSSION 

With some imagination, AVS provides many 
useful tools to visualize three-dimensional data. 
Before the 1993 AGU spring meeting (May 26 — 28) 
when this book was initiated, we used AVS version 
4.1 on a DEC 5000/200 workstation with 32 MB 
memory. Although AVS is also available on the 
Convex machine at the University of California, 
Irvine, we found we had to share that resource with 
many other users and the processing time was quite 
slow. AVS 5.0 was released before the AGU 
meeting, and it recently became available. This new 
version of AVS has several major improvements. 

For instance, a color legend module is added to the 
standard AVS module library. This module is useful 
for studying data quantitatively. 

A V S is also useful for viewing and manipulat- 
ing large three-dimensional satellite data sets. The 
outputs from mass production of satellite data are 
data files containing valuable information for 
scientific studies. For example, since launch in 
September 1991, UARS, with nine scientific 
instruments onboard, has been making atmospheric 
and space environment measurements over two 
Antarctic winter/spring (1991 and 1992) and two 
Arctic winter/spring (1992 and 1993) periods, and it 
has already started its 1993 Antarctic winter/spring 
measurements. The concentrations of many atmo- 
spheric species have never been observed prior to 
UARS. An efficient way to view those data is 
badly needed, which not only allows scientists to 
scan over a large amount of data in a relatively 
short period, but also facilitates a quick publication 
of the analysis. As we showed in previous sections, 
the two AV S networks we constructed, combined 
with on-screen object manipulating functions of 
AVS, allow us to examine the data efficiently. 

With a little help, atmospheric scientists can learn 
a few basic functions (software components) that are 
commonly used in their field. A V S provides hundreds 
of standard modules, but only a few are useful for a 
specific case in a scientific sub-field to display the data 


in a meaningful way. AV S also provides user-friendly 
control panels to allow users to change adjustable 
parameters in a module, as well as viewer control 
panels to allow users to manipulate objects and to 
optimize the viewing effect. Learning a new visualiza- 
tion package and getting familiar with it are always 
time consuming and work intensive, but it is quite 
enjoyable to work on AVS with the HALOE data. 

Although A V S allows users to write their own 
modules, it is not usually the preferred choice. 
While AV S modules are actually C functions or 
FORTRAN subroutines, they contain enough AVS- 
based function calls to make the subroutine creation 
process intimidating enough for the casual pro- 
grammer. However, when needed, the ability to 
build a module can be easily expanded to create 
modules that can manipulate the scientific data, 
imagery of that data (without resorting to X win- 
dow programming), or even three-dimensional 
geometries from that data (without any knowledge 
of the underlying graphics language on that plat- 
form). A substantial number of such user-generated 
modules are stored at the International AVS Center 
at the North Carolina Supercomputer Center. Such 
modules can be obtained from the center via 
anonymous ftp to avs.ncsc.org. 

There were a few disadvantages. Anew 
software package will always have initial defects, 
which are eventually overcome in later releases. 
There are some problems to be considered before 
investing time and effort. 

The major problem was the processing rate for 
our data, which was 181x145x13 (longitude x 
latitude x pressure), and was very slow using a 
DEC 5000/200 workstation (23 spec mark) with 32- 
MB memory. The graphic board that the DEC 
station uses is a PXG board, which has a speed of 
approximately 55,000 shaded polygons/second. 

The rotation of our “object” (see Figure 5) is slow 
but tolerable. The data rendering, for instance, 
changing from one constant pressure surface to 
another, is quite slow (typically a couple of min- 
utes). We assume that the 32-MB memory is not 
adequate (swapping between the CPU and its hard 
disk is usually the problem). A faster CPU speed 
and larger memory (larger than 32 MB) is recom- 
mended for similar size data sets. 

A V S is a very powerful tool for viewing three- 
dimensional data, but from our experiences, it is not 
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a good package for two-dimensional or one- 
dimensional plots (current version). The labeling 
ability of AVS is relatively poor, which not only 
affects the quality and readability of two-dimen- 
sional plots but also causes difficulty in quantitative 
study using three-dimensional display. Another 
constraint is that users have to use AVS line com- 
mands to group their selected objects and manipu- 
late the object group (rotation/translation). Al- 
though writing one’ s own module could solve many 
or all of the above problems, many individual users 
will lose interest in using this package for some of 
their studies. Later AV S versions could accommo- 
date more user demands with standard functions. 
The linkage of AVS with other mathematical and 
visualization tools (e.g., IMSL and IDL) obviously 
adds power to this multi-software package, al- 
though we have not used these combinations on our 
data set. 
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To better predict global climate change, scientists are 
developing climate models that require interdisciplinary 
and collaborative efforts in their building. We are 
currently involved in several such projects but will 
briefly discuss activities in support of two such comple- 
mentary projects: the Atmospheric Radiation Measure- 
ment (ARM) program of the Department of Energy and 
Sequoia 2000, a joint venture of the University of Cali- 
fornia, the private sector, and government agencies. 
Our contribution to the ARM program is to investigate 
the role of clouds on the top of the atmosphere and on 
surface radiance fields through the data analysis of 
surface and satellite observations and complex model- 
ing of the interaction of radiation with clouds. One of our 
first ARM research activities involves the computation of 
the broadband shortwave surface irradiancefrom satel- 
lite observations. Geostationary satellite images cen- 
tered over the first ARM observation site are received 
hourly over the Internet network and processed in real 
time to compute hourly and daily composite shortwave 
irradiance fields. The images and the results are trans- 
ferred via a high-speed network to the Sequoia 2000 
storage facility in Berkeley, where they are archived. 
These satellite-derived results are compared with the 
surface observations to evaluate the accuracy of the 
satellite estimate and the spatial representation of the 
surface observations. In developing the software in- 
volved in calculating the surface shortwave irradiance, 
we have produced an environment whereby we can 
easily modify and monitor the data processing as re- 
quired. Through the principles of modular program- 
ming, we have developed software that is easily modified 
as new algorithms for computation are developed or 
input data availability changes. In addition, the software 
was designed so that it could be run from an interactive, 
icon-driven, graphical interface, TCL-TK, developed by 
Sequoia 2000 participants. In this way, the dataflow can 
be interactively assessed and altered as needed. In this 
environment, the intermediate data processing "im- 
ages” can be viewed, enabling the investigator to easily 
monitor the various data processing steps as they 
progress. Additionally, this environment allows the 
rapid testing of new processing modules and allows their 
effects to be visually compared with previous results. 


1. INTRODUCTION 

To aid in the prediction of global climate 
change, scientists are developing climate models 
that require interdisciplinary and collaborative 
efforts in global data acquisition, complex model 
building, and innovative result analysis and inter- 
pretation. Scientific teams, distributed worldwide, 
analyze and produce fields of geophysical param- 
eters used as input to climate models. These 
parameters include sea surface temperature, surface 
albedo, carbon flux, wind fields, atmospheric 
temperature, and moisture profiles, among others. 
To accomplish this enormous task, data need to be 
easily transferred between sites via high-speed 
computer networks at regular intervals, processed at 
various sites, and finally transferred to data base 
storage facilities for other scientists to utilize. We 
have developed a system for producing global fields 
from geostationary satellite data and visually 
assessing the results as a quality control measure. 
This system computes surface solar radiation flux, 
but the approach can be generalized and applied to 
any computer model, especially satellite data 
processing systems. 

2. DATA SETS AND INFRASTRUCTURE 

We have integrated the capabilities of two 
distributed projects in which we are involved to 
produce the computation and display system 
described herein: the ARM (DOE, 1990) program 
of the Department of Energy and Sequoia 2000 
(Dozier, 1993), a joint venture of the University of 
California, the private sector, and government 
agencies. These are both long-term projects; we 
have developed the computer code of our radiative 
transfer modeling system with this in mind. 
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ARM is a multi-investigator research endeavor 
aimed at studying the role of clouds in climate 
through the improvement of their parameterization 
in numerical models. A 10-year field program is 
being developed to collect a comprehensive data set 
to characterize cloud properties and their interaction 
with the environment. The data set is collected 
from a number of high data accumulation rate 
instruments, such as surface interferometers, 
satellite imagers, and atmospheric sounders. 

Our contribution to the ARM program is to 
investigate the role of clouds on the top of the 
atmosphere and on surface radiation fluxes through 
the data analysis of surface and satellite 
observations and the complex modeling of the 
interactions between solar radiation and clouds. 

One of our first ARM research activities involves 
the computation of a shortwave (0.25 - 4.0 pm) 
solar radiation flux at the surface from satellite 
observations, and we use this task as the example 
for our visualization system. Geostationary satellite 
images from GOES, centered over the first ARM 
Southern Great Plains (ARM-SGP) observation site 
(located in Oklahoma), are received hourly over the 
Internet network and processed in real time to 
compute hourly and daily surface shortwave 
irradiance. Every hour, images are received and 
rectified to ground coordinates. These images and 
the results of our radiative transfer model (RTM) 
are transferred via a high-speed network to the 
Sequoia 2000 storage facility in Berkeley, where 
they are archived and made available to other 
scientists. In parallel, we receive in-situ surface 
shortwave flux measurements from the ARM-SGP 
site py ranometer for model validation. 

Sequoia 2000 is an interdisciplinary program 
bringing together global change and computer 
scientists to address issues of rapid data transfer, 
data management, and advanced visualization 
techniques in support of global change research. 
Global change science, in turn, serves as a test bed 
for the advanced distributed information systems 
developed by the computer scientists. The system 
provides the infrastructure to enable scientific 
interactions among researchers of different disci- 
plines and among researchers employing different 
methodologies. The program has developed an 
information system, not just a data system. For 
example, the Sequoia 2000 data base provides 


geophysical and biological information, not just raw 
data from space-borne instruments or in-situ 
sensors. 

3. RADIATIVE TRANSFER MODEL AND 
ALGORITHM 


At the heart of the visualization system is the 
RTM, which produces the data layers for the 
graphical user interface (GUI). The simple model 
used was first introduced by Gautier et al., (1980), 
with later improvements from Diak and Gautier 
( 1 982). The following is only a general description 
of the model because the purpose of this paper is to 
describe our data handling, algorithm design, and 
visualization techniques. A flowchart for the current 
processing algorithm is displayed in Figure 1 . 
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Figure 1. Flowchart of the radiative transfer model. 


Daily average downwelling surface solar 
radiation fluxes are produced from visible geosta- 
tionary satellite images and are recorded hourly 
for our experiment. As the first step, the process- 
ing system ingests the visible satellite data and 
calculates both solar and satellite geometries 
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(zenith and azimuth angles). Upwelling radiance 
values are then computed using satellite calibration 
coefficients obtained from Whitlock (personal 
communication, 1994). A surface albedo is esti- 
mated by computing the minimum brightness 
values from a set of images taken at the same 
hour each day over the previous 15 days. This 
“minimum brightness” compositing technique is 
quite common in remote sensing. The minimum 
brightness value is assumed to correspond to a 
“clear sky” value for each pixel. The directional 
surface reflectance is calculated from this 
value and is used to estimate the surface 
broadband albedo using the spectral and direc- 
tional transformation characteristics of the surface 
type over which the solar flux is estimated. 

Two atmospheric parameters are calculated 
next: ozone transmittance and Rayleigh scattering. 
Subsequently, acloud detection threshold value is 
computed, based on the clear sky pixel values, to 
distinguish between clear and cloudy pixels. A 
computation for downwelling surface solar flux 
is made for either the clear or cloudy conditions, 
based on the cloud detection results. If the condi- 
tion is determined to be clear, a clear sky radiative 
transfer model is applied, which computes the 
attenuation of the solar flux by water vapor and 
ozone absorption and by Rayleigh and aerosol 
scattering using climatological concentrations of the 
different absorbers and scatterers. If the condition 
is determined to be cloudy, cloud albedo directional 
reflectance is computed and used as the cloud 
broadband albedo. The surface solar flux is calcu- 
lated using the cloud albedo to estimate cloud 
transmission and absorption and then applying this 
to the computed downwelling clear flux. After the 
hourly observations are processed for each pixel 
composing the image, the hourly estimates are 
integrated over time to obtain daily averaged 
estimates. 

The satellite-derived results are compared with 
surface observations to evaluate the accuracy of the 
satellite-based estimates. Comparisons are per- 
formed on a 3-km-by-3-km spatially averaged 
model estimate to compensate for locational inaccu- 
racies of the satellite observations and spatial 
integration (2 solid angle) of the surface observa- 
tions. To further adjust for these effects, the 
surface observations are time averaged over 20- 
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Figure 2. In-situ shortwave measurements versus radiative 
transfer model estimations. 


minute intervals encompassing the satellite obser- 
vation time — the average time for clouds to move 
across the 3-km field of view. Averages from 
approximately 800 observations are compared with 
the RTM estimates and are displayed in Figure 2. 
The model producesvery good results, with a mean 
difference of about 45 Wm* 2 and a standard devia- 
tion of the mean difference of 53 Wm 2 , or about 
10 percent of the mean value. The mean difference 
indicates that there is a small bias in the satellite 
estimates, which might result from either the 
satellite sensor calibration, the surface pyranometer 
calibration, or the model assumptions. With high- 
quality surface instrument calibration, these 
comparisons could be used to improve the satellite 
sensorcalibration. 

4. SOFTWARE SYSTEM 

As mentioned previously, this RTM is a long- 
term project and in the future will be continually 
refined, modified, and improved. To facilitate this, 
we have developed, through the principles of 
modular programming, a software system that is 
easily modifiable as new algorithms for compu- 
tation are developed or as new observational 
inputs become available. The algorithm functions 
have been developed to be coherent, self-contained 
modules that perform calculations for individual 
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radiative transfer properties, such as Rayleigh 
scattering or surface reflectance. To maintain a 
simple, logical program flow, all principal algo- 
rithm functions are called from the main program 
and all “accounting” procedures, such as time 
tracking, are executed in the main program. This 
leaves the modules free of extraneous activities and 
allows them to be easily modified or replaced with 
new modules in the future. Furthermore, all the 
parameters are passed to each function through the 
argument list, thereby preventing the use of global 
variables that unnecessarily interconnect the 
modules. As model development progresses, the 
modules can be “plugged” in and out without the 
burden of algorithm accounting details affecting the 
modules’ performance. As an example, the cloud 
albedo routine does not require the current process- 
ing observation time to calculate solar geometries, 
but instead is passed the geometric quantities 
through its argument list and requires only the 
number of observation pixels to process. 

All the above-mentioned routines produce a 
“data layer,” or intermediate data set, that is used 
by other routines — that is, each function produces a 
parameter for the entire hourly image and passes 
that computation to other functions. This “data 
layer” approach allows for a clean style of 
programming, but is memory intensive because 
each layer contains the entire field that is being 
processed in the run. Efforts are currently under 
way to enable the program to process smaller 
segments of the images with a user-definable 
segment size. Again, all these accounting 
procedures will be independent of the modules and 
executed in the main program. 

Written in the C programming language, the 
RTM algorithm has been developed with command 
line options, and the getopt command is used to 
parse the options on the command line. We have 
developed the RTM algorithm so that all the 
intermediate data sets can be output for 
visualization with a command line “flag” selection. 
If the option flag for a data layer is chosen, such as 
“-a” for surface albedo, the layer is written to the 
computer disk and can subsequently be displayed or 
analyzed by other routines. For example, the user 
can set such options as day of year to process and 
select which type of output to generate with the 
command line statement 


RTM -Y 93 -D 36 -s -a 

setting the day to process to year 1993 (-7 93), day 
of year to 36 (-D 36) and requesting the outputs, 
shortwave (-s), and surface albedo (-a). This design 
feature allows for a convenient coupling with the 
display routines. 

4.1 Display Routines 

Our data visualization is performed, in part, 
with Interactive Data Language ( IDL ), a graphical 
display software [Research Systems, Inc., 1991]. It 
is a commercially available interactive software 
system and programming language designed for 
plotting and visualizing images of scientific data. 
The language contains simple syntax routines for 
displaying data that can be combined to produce 
highly customized display programs for the visual- 
ization of the RTM products. IDL provides mul- 
tiple plotting capabilities so that several images can 
be displayed simultaneously in the same window as 
in Figure 3. This display configuration allows a 
side-by-side comparison of the parameters that are 
produced in the processing of the model. In this 
example, six data layers are displayed in the panel. 
In Figure 3(a), the model’s input, GOES-7 bright- 
ness field is displayed. Also displayed with each 
data layer is a self-scaling legend and the average 
value of that parameter as computed by the IDL 
display routine. Figures 3(b-e) display some of the 
intermediate processing steps generated by the 
RTM — namely, the solar zenith angle, cloud 
detection results, and the estimated cloud and 
surface albedos, respectively. The resulting 
downwelling shortwave flux to the surface is 
displayed in Figure 3(f). With this graphical 
configuration, we can visually inspect the step-by- 
step progress of the model. By examining the value 
range in the legend, or the statistics, we can critique 
any changes in the results as compared to previous 
runs. These composite visualization images can 
then be read in as frames for an animation sequence 
and played back at varying speeds on the computer 
screen or stepped through on a frame-by-frame 
basis with the IDL animation tool. In this manner, 
the model’ s step-by-step progression can be exam- 
ined for each processing day. 
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Figure 3. Intermediate data sets displayed by the visualization system. 


4.2 Graphical User Interface 

TCL-TK is the GUI used in our software 
system. It was developed by Sequoia 2000 partici- 
pants (Ousterhout, 1993). TCL-TK is a shell 
language GUI builder, and the syntax is similar to 
the UNIX shell programming language. It is public 
domain software available via anonymous ftp at 
sprite.cs.berkeley.edu. It runs on various UNIX 


platforms (Sun. DEC, IBM, HP. and SGI), as well 
as PC platforms (SCO, UNIX, and LINUX). The 
source code uses about 23 Mb of disk space to 
compile, and it runs on a PC with 8 Mb of RAM. 

We found that programming in TCL-TK was 
preferable to developing GUI’s using Motif or X 1 1 
library functions in C programs. The programmer 
can write scripts that are about 1 0 percent of the 
size of the C programs, with a similar order of 
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magnitude in CPU execution time savings. Public 
domain scripts written in TCL-TK can be used to 
build TCL-TK applications that use a mouse-driven 
selection mechanism. This capability allows users 
who do not know the command set to create their 
own interfaces without the overhead of learning the 
script language. 

An example of the TCL-TK interface is dis- 
played in Figure 4. The interface has a button 
orientation similar to many menu-driven software 
packages available today. Buttons can be pro- 
grammed to execute processes or execute display 
routines, open other windows, toggle options, or 
select values for options. The TCL-TK interface to 
the RTM algorithm allows the user to visualize 
model results as it is executing. This is accom- 
plished by setting up two named pipes (or pro- 
cesses). The first pipe is connected to the RTM 
algorithm and, as the algorithm is executing, it 
writes images to disk files as they are processed 
during each iteration. The TCL-TK interface 
oversees this process. When the files are produced, 
the necessary commands are written to the second 
pipe, an IDL process, which then reads and displays 
the images as they are produced. 

The TCL-TK interface sets up the command 
line statement for execution, as described above, 
and performs the execution upon selection of the 
process button. The TCL-TK interface provides an 
easy way for users to select the parameters to use as 
input to the model. A user can use widgets to select 
various options and manipulate slide bars to choose 
values for those options. Examples of all of these 
modes are displayed in Figure 4. In the upper left 
comer is the execution button Execute RTM. Selec- 
tion of this widget, with the mouse, starts the 
execution of the RTM after all other options are set. 
Directly below this button, the Processing Options 
button opens the Processing Output Options win- 


dow displayed in the lower right comer of Figure 4. 
In this window, up to six of the listed parameters 
can be selected to be written out and hourly prod- 
ucts displayed during processing (as in Figure 3). 
When the options are selected, the checkboxes 
immediately to the left of the names are highlighted. 
The Processing Machine options allow the user to 
select the computer with which to execute the RTM. 
In the upper left window panel is a button for Demo 
Mode, in which the model is not executed but 
instead only the IDL display routines are performed. 
The Demo Mode is a method for easy review of 
previously processed data layers. 

Below this are three slider selectors; the first two 
produce inputs for the command line options to the 
RTM, and the third produces input for the five “Dis- 
play ...” buttons below it. The two images in Figure 4 
are examples of these “Display ...” buttons. The lower 
left image is the daily integration of downwelling 
shortwave over the ARM-SGP site, and the upper left 
image displays the results of another RTM to a global 
data set. The Display Lat, Lon and Value button 
executes an IDL command that opens another window 
and displays the associated latitude, longitude, and 
image values for the location in the image over which 
the cursor is located. 

Table 1 shows an example of the TCL-TK code 
used to produce the Execute RTM button, which 
illustrates its similarity to UNIX shell programming. 

4.3 Visualization System 

In developing the software involved in calculat- 
ing the surface solar radiation flux, we have 
produced an environment whereby we can easily 
monitor and modify the data processing as re- 
quired. Occasionally, the images that are re- 
ceived over the network contain missing or 
corrupted data. With our visualization system. 


Table 1. 


button .RTM -text “Execute RTM" -command RTM 

# define RTM button 

proc RTM { mode year doy options} { 

# procedure to run the RTM 

if l ($mode== “Demo”) } { set options “Sopti ons-D” } 

# set option value for Demo mode 

set $Sh _pipe [open {\/bin/sh } w+ ] 

# open pipe to shell 

set command “ \” cd $process_dir\; RTM -Y $year -D $doy $options\" “ 

# set value of command string 

puts $Sh _pipe $command 

# print command to shell pipe 

Display _hourlies 

} 

# execute display routine 
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these defective images can be visually detected and 
corrected, if feasible, or eliminated from the pro- 
cessing stream. As the model produces results for 
the intermediate data steps, they are displayed in a 
window (as in Figure 3) until the next iteration of 
the model is completed. The image then is read in 
as a frame for the animation routine, and the next 


iteration’s set of selected images is displayed. When 
the model finishes executing, the images that have 
been saved in the buffer are sequenced together 
automatically for review by the user. We use this 
system to play back the series of images that the RTM 
algorithm produces over the course of its execution. 
The animation is useful for examining the images in a 
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Figure 4. Visualization system TCL-TK interface ( upper left and lower right), GOES visible channel over the ARM-SGP site ' lower 
left), and a global application of another radioactive transfer model. 
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time sequence that can reveal error patterns not usually 
seen in static image evaluation. For example, upon 
examining some of the first outputs of the RTM 
algorithm, we observed dark patches in the surface 
albedo data layers. When these were sequenced 
together, a pattern developed that was recognized as 
cloud shadows moving across the images. We have 
since developed a routine for removing these effects 
from the surface albedo map. 

The TCL-TK interface can be programmed to 
open other windows with multiple options. This 
promotes the convenient organization of display 
routines that are developed in the IDL language and 
used to display various inputs or outputs during 
model development. Menus are opened, and a 
number of display routines or other processing 
programs and their associated option selection 
widgets can be organized for easy execution. This 
is effective during the development stage of the 
model and the daily processing of the data streams 
received over the network. 

5, CONCLUSIONS AND FUTURE PLANS 

The TCL-TK programming language was easy 
to use for developing complex packaging of display 
routines. This capability allowed us to design a 
convenient method for displaying our RTM algo- 
rithm results during execution and provided a 
means for interactively monitoring our RTM 
algorithm as it is developed. The results reported 
here are for limited size images (400 x 700 pixels), 
but we are in the process of developing a similar 
system for global images produced by the Interna- 
tional Satellite Cloud Climatology Project (ISCCP) 
[Shiffer and Rossow, 1983]. This system will 
compute the surface solar flux, photosynthetically 
available radiation (PAR), and ultraviolet A and B 
for global images. 
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Visualization of Stratospheric 
Ozone Depletion and. the 
Polar Vortex 


LLOYD A. TREINISH 

IBM Thomas J. Watson Research Center, Yorktown 
Heights, NY 

Direct analysis of spacecraft observations of strato- 
spheric ozone yields information about the morphol- 
ogy of annual austral depletion. Visual correlation of 
ozone with other atmospheric data illustrates the diur- 
nal dynamics of the polar vortex and contributions 
from the upper troposphere, including the formation 
and breakup of the depletion region each spring. These 
data require care in their presentation to minimize the 
introduction ofvisualization artifacts that are errone- 
ously interpreted as data features. Nongeographically 
registered data of differing mesh structures can be 
visually correlated via cartographic warping of base 
geometries without interpolation. Because this ap- 
proach is independent of the realization technique, it 
provides a framework for experimenting with many 
visualization strategies. This methodology preserves 
the fidelity of the original data sets in a coordinate 
system suitable for three-dimensional, dynamic exami- 
nation of atmospheric phenomena. 


1. INTRODUCTION 

In the Earth and space sciences, it is very 
common to organize geographically located data as 
a rectilinear grid with horizontal extent over the 
entire surface of the Earth (i.e., latitude and longi- 
tude). In two dimensions, this implies a topological 
primitive that is a rectangle of various sizes. In 
three dimensions, the cell is a parallelepiped, with 
the height corresponding to altitude or atmospheric 
pressure, for example. These rectilinear mesh 
structures are ill-suited for the study of phenomena 
that occur continuously over a nominally spherical 
surface (i.e., they tear the data). In addition, these 
grids may not be fully populated because of missing 
data (such as when observations could not be 
made). In such cases, the grids could be viewed as 
being irregular. 

Independent of grid type, cartographic tech- 
niques are often introduced to suitably deform the 
data to compensate for the problems inherent in the 
original structure (i.e., the use of a rectilinear 
representation for a spherical surface). Tradition- 
ally, such a transformation is accomplished by 
defining a new Cartesian grid in the cartographic 
projection coordinate system and then interpolating 
from the original rectilinear grid to the new one 
prior to any other operation. Given the curvilinear 
nature of such transformations, nonlinear interpola- 
tion techniques are typically required to make the 
transformation of acceptable quality. Figure 1 
illustrates this operation schematically, in which the 
data values at three points in the original rectilinear, 
regular grid are combined to define the value at a 
single point in transformed space (e.g., the values 
are weighted averaged, where the weight is a 
function of distance from their original points to 
that in transformed space). In addition to being 
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Regular Grid Regular Grid 

in Transformed 
Space 

Figure 1. Data transformation by interpolation. 

computationally expensive, such interpolation 
makes it impossible to preserve the fidelity of the 
data prior to rendering, especially if regions of no 
data or other discontinuities are present. Such 
discontinuities would be smoothed out. 

Alternatively, by warping the underlying 
mesh structure, the geometry itself is transformed 
without affecting the data. Figure 2 illustrates this 
operation schematically in which there is a one- 
to-one mapping of each point in the original and 
the transformed grid. At each node in the de- 
formed grid, there is a data value that corresponds 
to a specific node in the regular grid. 



Figure 2. Grid transformation by coordinate warping. 

Thus, any realization (the application of one or 
more visualization strategies that generate render- 
able geometry from a collection of data) is indepen- 
dent of the choice of a specific cartographic coordi- 
nate system, the data or mesh themselves, or how 
the data are specified with respect to the underlying 
mesh (e.g., assigned at each node or to an entire cell 
at its center). In addition, interpolation is not 
required as the initial operation to be applied to the 
data to be visually correlated. Instead, interpolation 
can be isolated to be the last step in the visualiza- 
tion process, namely rendering (e.g., Gouraud- 
shaded surfaces). 

The use of appropriately warped curvilinear 


grids can preserve the fidelity of the data prior to 
rendering. This requires an environment that 
supports direct realization and rendering of data on 
curvilinear grids as well as regular ones. This must 
be coupled with the ability to independently ma- 
nipulate data and its underlying mesh structure or 
base geometry. In addition, the ability to simulta- 
neously render disparate geometry (such as points, 
lines, surfaces, and volumes of varying color and 
opacity) is helpful in viewing the realization of 
multiple data sets. 

Correlative visualization implies two tenets. 
The first is the capability to look at multiple sets of 
data in exactly the same fashion (visual comparison 
within a common framework)-— that is, displaying 
different sets of data of disparate structure in an 
arbitrary geographic coordinate system independent 
of the data sets. The second is the capability to use 
a variety of visualization strategies, within the 
chosen coordinate system, for examining a single 
set of parameters from one source or many param- 
eters from multiple sources. Specific representation 
techniques illustrate different aspects of data. 
Hence, no single method is always suitable, and not 
all techniques are useful for all data sets. 

Therefore, methods that support the registration 
of multiple data sets in geographic coordinates, 
using similar cartographic warping of the respective 
data locations for the data sets in question, show 
promise as an alternative to meshing, interpolating, 
or resampling the original grids of each data set to a 
common rectilinear grid in projected space for 
correlative visualization in the Earth and space 
sciences. The former is used via the IBM Visualiza- 
tion Data Explorer developed by Visualization 
Systems, IBM Thomas J. Watson Research Center 
[Lucas et ah, 1992], This is in contrast to the latter 
approach used by the author via the NSSDC Graph- 
ics System, developed by NASA Goddard Space 
Flight Center [Treinish, 1989; Treinish and 
Goettsche, 1991]. 

2. STRATOSPHERIC OZONE DEPLETION 

AND THE POLAR VORTEX 

The aforementioned approach for the analysis 
of multiple data sets via correlative visualization 
can be illustrated by an example scientific prob- 
lem. There is a phenomenon known as the polar 
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vortex that occurs in the Earth’ s upper atmosphere 
(primarily the stratosphere) above Antarctica during 
the winter and early spring of every year [Schoeberl 
and Hartmann, 1991]. This effect is characterized 
by a cyclonic circulation pattern around the South 
Pole. Many researchers believe that ozone-destroy- 
ing chemicals are trapped in this vortex during the 
cold and dark of Antarctic winter. Once spring 
begins and the polar region emerges from the long 
night, it is theorized that these substances react 
photochemically with ozone to break the molecule 
apart and thus aid in the creation of the so-called 
Antarctic ozone hole. Hence, in late winter, regions 
of ozone depletion around the pole begin to form. 
Within a few weeks the ozone hole is completely 
established. By late spring the vortex weakens, 
causing the ozone depletion region to fragment and 
eventually dissipate. The questions of interest are: 
what are the characteristics of the south polar 
vortex that can be derived from diurnal observa- 
tions of atmospheric dynamics, and how do they 
relate to independent measurements of ozone? The 
study of the appropriate data sets for the Southern 
Hemisphere winter and spring (June through 
December) are relevant. The examination of a 
single year, 1 987, is made because that year showed 
the greatest amount of ozone depletion until recent 
years [Krueger etal., 1992], 

2 . 1 Total Column Ozone Data 

Perhaps the most critical effort to study strato- 
spheric ozone has been via observations made by 
the Total Ozone Mapping Spectrometer (TOMS) 
aboard NASA’ s Nimbus-7 spacecraft. Nimbus-7 is 
in a (polar) sun-synchronous orbit, which means 
that it can roughly provide global coverage of the 
Earth for its suite of instruments once per day. Each 
portion of the Earth was observed nominally under 
the same illumination conditions from day to day. 
Measurements made by Nimbus-7 TOMS show the 
daily global distribution of stratospheric ozone from 
late 1978 until early May 1993. It measured the 
total column density of stratospheric ozone by 
observing back-scattered solar ultraviolet radiation 
in seven spectral bands. Approximately 200,000 
such measurements were made each day, which 
covered the entire globe [Fleig et al., 1986]. 

TOMS required sunlight to operate. Hence, 


there are periods of missing data because of local 
polar winters (darkness) in addition to the usual 
data dropout problems associated with spacecraft 
observations. These regions are visible as gaps in 
various realizations of the data. They are not the 
ozone hole. The data have been gridded in a regular 
lattice of 180 (1.0° in latitude) x 288 (1 .25° in 
longitude) from the raw observations for daily 
global coverage with cells, without data being 
flagged. The locations of missing cells are consid- 
ered in all realizations. The total ozone measure- 
ments are in Dobson units (DU’ s), corresponding to 
a column density of 2.69 x 10 16 molecules of ozone 
cm- 2 . 

Figures 3a and b show traditional two-dimen- 
sional visualizations of the ozone data on Octo- 
ber 1, 1987, which is during the ozone depletion 
season. The rectangular presentation of the data is 
consistent with the provided mesh in that it is torn 
at the poles and at a nominal International Date 
Line. This cartographic representation of the Earth 
is known as a cylindrical equidistant or plate carre 
projection. The ozone data are overlaid with a map 
of world coastlines and national boundaries. In 
Figure 3a, the data are realized as iso-contcur lines 
at 50 DU intervals, which indicate the spatial 
distribution of discrete thresholds within continuous 
data. In Figure 3b, a pseudo-color spectrum is used, 
which is linearly mapped over a range ( 1 1 0 to 650 
DU) that is valid for the year of study (to provide 
consistent comparisons between single days or for 
animation), and it should provide a continuous 
representation of continuous data. Figure 3b also 
has a fiducial overlay (lines of latitude or parallels 
and longitude or meridians at 30° spacing) in white, 
which have been registered in this same rectilinear 
coordinate system. The grid cells where there are no 
data are visible as gaps in the pseudo-color realiza- 
tion. The area of low ozone is visible as a bluish 
band stretched across the bottom of the pseudo- 
colored rectangle. 

Non-rectilinear cartographic projections are 
used relatively often by Earth scientists as a way of 
preserving area in a display of the entire globe 
compared to the cylindrical equidistant projection. 
Other projections may preserve shape or linear 
distance, for example, on selected portions of the 
globe [Pearson, 1990]. 

To examine a continuous phenomenon with a 
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Total Column Ozone (Dobson Units) — October 1, 1987 



Figure 3a. Contour-mapped global column ozone on October 1, 1987. 



Figure 3b. Pseudo-color-mapped global column ozone on October 1, 1987. 


central focus far from the Equator and the Prime 
Meridian, such as the ozone hole, a non-cylindrical 
projection is required. Figure 4 illustrates the same 
data as in Figure 3, except polar orthographic 
projections for both the Southern and Northern 
Hemispheres are employed, which are shown in 
the left and right side, respectively, of the figure. 
For the orthographic projection, all meridians are 
straight lines radiating from the central pole. The 


parallels are concentric circles, which become 
compressed toward the Equator. The orthographic 
projection can be characterized by 

x = R cos ( cp ) cos (9) (1) 

y = R cos ( cp ) sin (9) 

where [cp, 0] represents the location of each node 
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Total Column Ozone (Dobson Units) 


October 1, 1987 


Figure 4. Polar orthographic-warped pseudo-colored deformed surfaces of total ozone. 


on the Earth’s surface in the original mesh as 
[latitude, longitude] , R is a scaling radius 
(e.g., 90°) and [x, y] represents the location of 
each node in the deformed, curvilinear (ortho- 
graphic) coordinate system with an assumed pole 
point of 90° north latitude and 0° east longitude and 
an assumed pole point of 90° south latitude and 0° 
east longitude for the Northern and Southern 
Hemispheres, respectively. 

In addition to the pseudo-color spectrum, this 
two-dimensional cartographic projection of the data 
is extended by redundant realization as a deformed 
surface (i.e., both height and color correspond to 
ozone density). The bluish contiguous area over 
Antarctica clearly depicts the depletion region, 
illustrating the advantage of choosing an appropri- 
ate cartographic coordinate system. The height 
mapping clearly dramatizes the concept of a depres- 
sion in the ozone layer, while the color enhances 
this perception as color enhances a topographic 
map. It provides a continuous representation 
consistent with the spatial structure of the data and, 
in animation, presents the dynamics in an easily 


discernible fashion. 

The use of hue-based pseudo-color mapping for 
realization and rendering of data as images can 
create problems in interpretation because of how 
the human visual system responds to color 
[Rogowitz et al., 1992], For example, discontinui- 
ties appear in the pseudo-color representation, such 
as in Figure 3b, that are not in the data but are 
caused by how humans perceive hue (i.e., dis- 
cretely). However, such pseudo-color maps are 
virtually standard in many disciplines. The accep- 
tance of alternate, perceptually correct pseudo-color 
maps (e.g., luminance-based [Lefkowitz and 
Herman, 1 992]) would be limited in these disci- 
plines because of their unfamiliarity. Therefore, the 
introduction of redundant realization techniques, 
such as surface deformation, retains the familiar 
pseudo-color scale but helps lessen its negative 
perceptual impact. 

Below each translucent surface is a hemi- 
spherical map that has been registered in the same 
orthographic coordinate system as the ozone data. 
This map consists of a monochromatic topographic 
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surface, which is deformed based upon height 
above or below sea level. The monochromatic scale 
is chosen to impart the appearance of a topographic 
map (e.g., the oceans are dark gray) for each 
hemisphere. This surface is created from a topo- 
graphic data base on a rectilinear grid at 0.5° 
resolution. The grid is warped by the same ortho- 
graphic projection that was applied to the ozone 
data, although the data were originally in a different 
coordinate system at different resolution. Both the 
ozone and topographic surfaces are Gouraud 
shaded. In addition, the topographic map is overlaid 
with the same coastline map in magenta, with 
political boundaries corresponding to each hemi- 
sphere, that was used in Figures 3a and b. The map 
geometry was transformed in a manner similar to 
that of the ozone and topographic data. 

There is a seam in each of the orthographic 
surfaces corresponding to where east longitude is 
either - 1 80° or +1 80°, nominally the International 
Date Line, which is an artifact of the warping of the 
original rectilinear data onto a continuous surface, 
welding the discontinuity in the provided form of 
the data. The use of coordinate warping does 
preserve this inherent discontinuity in the data, 
which would not be the case if traditional interpola- 
tion techniques were chosen. In general, this seam 
will not smoothly connect the surface because of 
how TOMS gathers data. This scanning instrument 
examined each portion of the Earth at a different 
time of day, still covering the entire globe once per 
day. Hence, observations on each side of that line 
were taken approximately 24 hours apart and 
usually are not the same. 

2.2 Dynamics Data: Atmospheric Temperature and 

Winds 

Global atmospheric dynamics data (such as 
temperature and wind velocity) are often derived 
from spacecraft, balloon, and aircraft observations, 
which have been modeled and gridded on a 2.5° 
grid, 144 x 73 cells (longitude x latitude) at differ- 
ent levels in the atmosphere, based on their pres- 
sure. Hence, a two-dimensional slice of these data 
at a specific pressure level is organized in a torn 
mesh similar to that of the total column ozone, but 
at lower resolution and in a different geographic 
coordinate system. These data may also have gaps 


in coverage, including only a partial value for some 
wind cells (i.e., one or two of the three vector 
components are missing). 

If this cartographic theme is extended to three 
dimensions by performing a Cartesian to spherical 
coordinates transformation , then for a “spherical” 
projection, all meridians are great circles converg- 
ing at the poles. The parallels are also great circles, 
which become compressed toward the poles. The 
spherical projection can be characterized by 

x = (h + r ) cos (cp) sin (0) (2) 

y = -(h + r) cos (<p) cos (0) 
z = (h + r) sin(tp) 

where [cp, 0, r] represents the location of each 
node on the Earth ’ s surface as [latitude, 
longitude] at its radial distance [r] from the 
Earth’s center in the original mesh, h is the height 
above the surface of the Earth, and [x, y, z] 
represents the location of each node in the de- 
formed, curvilinear (spherical) coordinate system. 

For the data being examined, there are seven 
pressure levels (1,000, 850, 700, 500, 300, 200 and 
100 millibar [mb]). The data are treated as a true 
volume by warping the parallelepiped mesh repre- 
sentation of the atmosphere into a collection of 
concentric spherical shells that compose a volume 
of 73,584 nodes. The maps used in Figures 3 and 4 
are replaced by a topographic globe such that all 
data are registered in a common, spherical, Earth- 
centered coordinate system. The radius of the globe 
is chosen to be the same as the inner radius of the 
spherical shell corresponding to the 1 ,000-mb level. 

Figure 5 illustrates these ideas for both the 
temperature and wind data on October 1, 1987. For 
the temperature data, surface extraction techniques 
are employed because direct-volume rendering 
shows little quantitative information. The tempera- 
ture data are realized as pseudo-color and opacity- 
mapped isothermal surfaces. The isosurfaces are at 
194 and 294 K with the higher value (closer to the 
Earth’s surface) being more opaque. The pseudo- 
color spectrum on the left corresponds to that of the 
temperature isosurfaces. The wind data are 
realized via streamlines generated by numerically 
integrating the paths taken by injecting massless 
particles in the wind field at 150 points uniformly 
distributed within the volume. The lines are 
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Figure 5. Isosurfaces of atmospheric temperatures and streamlines of atmospheric wind velocity. 


Global Atmosphere (1000 - 100 mb) on October 1. 1987 
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Figure 6. Temperature surface (194 K) pseudo-colored by pressure height. 
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pseudo-color mapped by horizontal speeds ranging 
from 0 to 80 m/sec. which correspond to the 
pseudo-color spectrum on the right. The vertical 
component of the wind ranges from about -33 to 
+45 mb/sec. 

Further study of the data using these techniques 
shows that there is a region of cold air (around 
200 K) over the Antarctic during this period that is 
concentrated near 1 00 mb in pressure height, and 
that is surrounded by high-speed winds. The shape 
of this cold air mass and its diurnal variation at first 
glance seem to be similar to the depletion region in 
the total column ozone data. The shape of this air 
mass can be seen in Figure 6. Two spherical, wire- 
frame meshes surround a monochromatic globe. 

The outer green mesh of quads is at the 1 00-mb 
pressure level. The inner blue mesh of triangles is at 
the 200-mb pressure level. An isosurface at 194 K is 
shown for October 1 , 1987, which has two compo- 
nents. south polar and equatorial. 

The isosurface is pseudo-colored on a linear 
scale according to the pressure height at which the 
194 K value is determined. The pseudo-color scale 


ranges from blue ( 1 00 mb) to green (200 mb). Regions 
below 200 mb are colored pink. The south polar 
component of the isosurface is mostly at the 1 00-mb 
level. Hence, further study focuses on 1 00-mb data. 

2.3 Correlating Ozone With Temperature and Winds 

Two different approaches to visually correlating 
the ozone and dynamics data for the 1 00-mb level 
are taken applying the concept of coordinate 
warping to achieve geographic registration. The 
1 00-mb data are the same as those used in Figures 5 
and 6; hence, they are on a different grid than that 
of the ozone data. Because this study examines a 
phenomenon that is focused on a polar region and is 
nearly hemispheric in geographic extent, the 
orthographic cartographic projections discussed in 
equation ( 1 ) are used. They are illustrated using 
data from October 1, 1987, in an attempt to show 
the formation of the polar vortex and the ozone hole 
itsel in Figures 7 and 8. 

Figure 7 shows four separate and different data- 
driven representations of the atmosphere over the 



Figure 7 . Southern Hemisphere Orthographic Warped Atmospheric Data on October 1. 1987. 
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Southern Hemisphere in the same geographic 
coordinate system using the same three redundant 
realization techniques: pseudo-color mapping, 
surface deformation, and iso-contouring. This is the 
same approach as used for the ozone data in Figure 
4, except for the addition of contour lines. In the 
upper left is the ozone column density with con- 
tours every 50 DU, from 100 to 650 DU. The upper 
right shows the 100-mb temperature with contours 
every 5 K, from 180 to 245 K. The lower left shows 
1 00-mb horizontal wind speed with contours every 
10 m/sec, from 0 to 80 m/sec. Cells where one or 
more components of the wind velocity are missing 
are shown as gaps in this surface. 

In an attempt to show the correlation among 
these observable quantities in data space, a simple 
linear model is constructed such that 

M = C 0 0 + C t T + C v lv| (3) 

where O is the normalized total column ozone 
ranging from 0 to 1 (scaled for the dynamic range 
of 1 10 to 650 DU); T is the normalized 100-mb 
temperature ranging from 0 to 1 (scaled for the 
dynamic range of 1 80 to 245 K); I v | is the nor- 
malized 100-mb horizontal wind speed ranging 
from 0 to 1 (scaled for the dynamic range of 0 to 
80 m/sec); M is a scalar field representing a unitless 
linear combination of the three parameters ranging 
from 0 to 3; and C 0 , C T , and C v are normalized 
weighting factors, which are set to 1 . 

The ozone data are bilinearly interpolated to the 
grid on which the 100-mb data are available prior to 
the normalization. Independent gaps in both the 
ozone and wind measurements are properly main- 
tained in the computation of M, which is shown in 
the lower right with contours every 0.5, from 0.0 to 
3.0. Each of the surfaces shows a similar struc- 
ture — a depression of comparable shape and areal 
extent over Antarctica for low ozone, temperature, 
and wind speed, respectively, each with a boundary 
corresponding to that of the polar vortex. The 
relative contribution of each of these parameters to 
the depression structure for M can be examined by 
interactively adjusting the weighting factors (C 0 , 

C T , and Cy) in the computation of M. 

Figure 8 combines each of the parameters into 
one visual object. As with Figure 4, both hemi- 


spheres are shown in the left and right sides, 
respectively, of the figure. The data are stacked 
vertically and shown with topographic, coastline, 
and national boundary maps. The difference be- 
tween Figures 8 and 4 are representations for the 
100-mb horizontal wind velocity and temperature 
stacked between that of the ozone and the maps. 
Below the ozone surfaces are plates of vector 
arrows representing horizontal winds, whose 
directions correspond to the direction of the wind 
and size and color correspond to wind speed, 
ranging from 0 to 80 m/sec. 

Below the winds and above the maps are flat, 
translucent planes corresponding to the 100-mb 
temperature realized as pseudo-color-mapped, filled 
isothermal contours every 5 K over the range of 
180 K to 245 K. 

A daily animation sequence from June through 
September shows the availability of polar ozone 
data as well as the formation of the depletion region 
in both presentations. A similar signature is visible 
as a blue region in the temperature data, before the 
ozone depletion occurs, as a cold air mass forms 
over Antarctica in the winter and persists into 
spring. An analogous but less well-defined shape 
forms in the wind realization. With the onset of 
spring, ozone observations become available and 
form a similar depression pattern to that of the 
temperature and wind. During this period, daily 
animation shows a clear rotation of the depletion 
region surrounded by the boundary of the polar 
vortex. 

This rotation, which has a period of several 
days, is synchronous between the 100-mb and 
ozone data. The arrangement of the wind velocity 
arrows in Figure 8 evoke a cyclonic pattern corre- 
sponding to the polar vortex, which appears almost 
steady-state in the winter and early spring. By early 
November, the warming of the upper atmosphere 
over Antarctica is obvious with direct correspon- 
dence to the dissipation of the polar vortex in the 
wind data and the breakup of the ozone depletion 
region. 

Although the four time-varying surfaces do 
show the correlation among the ozone and dynam- 
ics data, they are difficult to observe, especially in 
animation. The eye tends to focus on one or two of 
the surfaces only. Therefore, at the cost of obscur- 
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Global Stratospheric Column Ozone Cl 10-650 DU) 
lOOmb Wind (0-80 m/sec) and Temperature (180-245 K): October 1, 1987 



Figure 8. Polar orthographic-warped atmospheric data on October!, 1987. 


ing some of the data, the stacked approach shows 
the synchronous circulation in each data set for the 
Southern Hemisphere during this period at a single 

glance. 

3. IMPLEMENTATION 

The techniques described herein have been 
developed via the IBM Visualization Data Explorer 
(DX), a general-purpose software package for 
scienti fic data visualization and analysis. It employs 
a data-flow-driven client-server execution model 
and is currently available on UNIX workstations 
(Sun. Silicon Graphics, Hewlett-Packard, IBM, and 
Data General), as well as a medium-grain, coherent 
shared memory parallel supercomputer, such as the 
IBM Power Visualization System [Lucas etal., 

1 992]. DX simplified the implementation of carto- 
graphic warping and its simultaneous application to 
disparate data sets for correlative visual display by 
providing an extensible tool kit of polymorphic 
operations that are interoperable and seem typeless 


to the user. This polymorphism is a consequence of 
DX being built on a foundation of a unified data 
model, which describes and provides consistent 
access services for any data that are to be studied, 
independent of shape, rank, type, mesh structure, or 
dependency or aggregation. As a result, regular and 
irregular, structured and unstructured data are 
uniformly supported, as well as regions where data 
are missing. 

The relevance of DX to correlative visualiza- 
tion problems is illustrated with a simple example. 
Although DX has several interfaces, the techniques 
discussed herein utilized visual programming, in 
which each computational task is assigned an icon 
and the flow of control and data are defined by 
connecting the icons as a direct acyclic graph. 
Figure 9 shows a visual program that generates a 
visualization of total column ozone as a radially 
deformed, pseudo-color- and opacity-mapped 
spherical surface surrounding a globe. Figure 9 also 
shows the pseudo-color and opacity map for the 
ozone surface via a color map editor and the 
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Figure 9. Example DX program to visualize global column ozone on October I, 1987. 


resultant image of the ozone data registered with a 
globe. Similar visual programs would show the 
atmospheric temperature around a globe, although 
the data are different in structure. 

The program in Figure 9 has the following key 
operations: 

1 . Import: read data from the disk. 

2. Include: subset data by value and indicate 
invalid data. 

3. Color: assign pseudo-color and opacity maps. 

4. RubberSheet: deform mesh by value. 

5. Sphere: warp mesh onto a sphere. 

6. Normals: compute normals for Gouraud 
shading. 

7. Globe: generate a globe representation of 
the Earth. 

8. Collect: aggregate inputs into a single entity. 


9. Image: render an image and provide direct 
interaction. 

For direct rendering of volumes, the 
RubberSheet operation would be eliminated. For 
extraction of surfaces from volumes, the 
RubberSheet module would be replaced with 
Isosurface. For a pseudo-color image (as in Figure 
3b), steps 1,2, and 3 would be used, optionally with 
a cartographic projection (e.g., Mollweide) and then 
Image (step 9). The Sphere and Globe operations 
are implemented as macros (visual programs 
without custom coding). The Globe operation 
uses steps 1, 3, 5, and 6 (with an option for 4) 
applied to topography data. The Sequencer opera- 
tion is for the sequence control used as a virtual 
VCR for specifying animation. The Select operation 
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identifies a member of time series. The Colormap 
operation is for the color map editor used for the 
mapping of data values to hue, saturation, value, 

and opacity. 

4. SUMMATION AND FUTURE WORK 

The application of cartographic warping to the 
correlation of global atmospheric data sets yields 
visualizations that illustrate a simple notion about 
the possible relationship between temperature and 
winds, as well as their contribution below the 
tropopause to the formation of the polar vortex and 
ozone depletion. The use of interpolation prior to 
realization is unnecessary for the visualization of 
gridded data in a system with a sufficiently robust 
infrastructure of data structure and geometric 
support. 

The ideas introduced with the analysis of 
observational data related to ozone and tropospheric 
dynamics can be extended by considering the 
correlation between these same ozone data and 
objective analyses that include the entire tropo- 
sphere and stratosphere as a continuum. Visual 
correlation of these data is also useful for the 
examination of potential depletion regions in the 
Northern Hemisphere or the dynamics conditions 
for their possible formation. Data reflecting the 
typically disturbed conditions in northern polar 
regions in mid- winter through spring relative to the 
Southern Hemisphere yield more complex realiza- 
tions than is the case for comparable southern polar 
regions. Although these data require more care in 
their presentation, an approach similar to that used 
for the Southern Hemisphere still applies. In 
addition, comparison of these data with spacecraft 
observations of clouds may yield additional insight 
into the dynamics of ozone depletion because polar 
stratospheric clouds are believed to provide sites for 
conversion of ozone-destroying chemicals from 
inactive forms to highly reactive ones during the 
darkness of polar winter [Hamill and Toon, 1991]. 
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The LinkedWindows Interactive Data System (LinkWinds) 
is a prototype visual data exploration system resulting 
from a NASA Jet Propulsion Laboratory (JPL) program 
of research into the application of graphical methods for 
rapidly accessing, displaying, and analyzing large mul- 
tivariate multidisciplinary data sets. Running under 
UNIX, it is an integrated multi-application executing 
environment using a data-linking paradigm to dynami- 
cally interconnect and control multiple windows contain- 
ing a variety of displays and manipulators. This para- 
digm, resulting in a system similar to a graphical spread- 
sheet, is not only a powerful method for organizing large 
amounts of data for analysis, but leads to a highly 
intuitive, easy-to-leam user interface. It provides great 
flexibility in rapidly interacting with large masses of 
complex data to detect trends, correlations, and anoma- 
lies. The system, containing an expanding suite of non- 
domain-specific applications, provides for the ingestion 
of a variety of data base formats and hard-copy output of 
all displays. Remote networked workstations running 
LinkWinds may be interconnected, providing a multi- 
user science environment ( MUSE) for collaborative data 
exploration by a distributed science team. The system is 
being developed in close collaboration with investigators 
in a variety of science disciplines using both archived and 
real-time data. It is currently being used to support the 
Microwave Limb Sounder (MLS) in orbit aboard the 
Upper Atmosphere Research Staellite (UARS). This 
paper describes the application of LinkWinds to this data 
to rapidly detect features, such as the ozone hole configu- 
ration, and to analyze correlations between chemical 
constituents of the atmosphere. 


1. INTRODUCTION 

Recent advances in remote sensing capabilities 
and computational power are providing unprec- 
edented ability to study our world. These improve- 
ments are also producing an ever-increasing flood 
of data that must be gathered, transported, stored, 
and analyzed to be fully utilized. This paper reports 
on a system capable of facilitating the rapid visual 
analysis of large masses of data, and discusses its 
application to upper atmospheric science. This 
system grew out of a NASA project at the Jet 
Propulsion Laboratory to study the application of 
computer graphics to the problems of quickly and 
interactively exploring and analyzing very large 
amounts of scientific data. The objectives of the 
program are to ( 1 ) develop a software environment 
that will support the rapid prototyping of visual data 
analysis applications, while at the same time 
maintaining the high level of performance neces- 
sary for interactively manipulating graphical 
displays; (2) develop a user interface that is truly 
intuitive, allowing quick access to the software for 
the novice as well as the advanced user; (3) provide 
a suite of sample applications that are useful across 
a variety of scientific disciplines; and (4) provide 
tools to support user development of applications 
for this environment. 

2. LINKWINDS 


The Linked Windows Interactive Data System, 
or LinkWinds, is a prototype product of this re- 
search effort. In compliance with the research 
objectives, LinkWinds, an integrated multi-applica- 
tion execution environment with a full graphical 
user interface (GUI), is a visual data analysis and 
exploration system designed to rapidly and interac- 
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tively investigate large multivariate and multidisci- 
plinary data sets to detect trends, correlations, and 
anomalies. The system, operating under UNIX, is 
based on an object-oriented programming model 
and is implemented in the C language. It draws 
upon the Silicon Graphics, Inc. (SGI), GL library 
for its GUI and graphics support software and 
presently runs only on workstations supporting this 
library. This includes all SGI workstations and 
those of other manufacturers that have licensed and 
support the GL library. Third-party software and 
the advent of Open GL as a standard graphics 
library will greatly increase the portability of 
LinkWinds in the near future. 

Data sets and individual tools for display or 
control of the data are coded as objects, each 
occupying a window on the LinkWinds screen and 
communicating with other objects through a mes- 
sage-passing protocol. The objects or windows can 
be linked or unlinked at the discretion of the user. 
Linking the windows sets up one-way message 
paths between objects. This data linking paradigm 
makes the system perform much like a graphics 
spreadsheet and, as in a spreadsheet, is a powerful 
way of organizing the data for analysis while 
providing a natural and intuitive interface. Data- 
linking, and its user interface implications, are 
discussed below. 

Messages generated by LinkWinds objects are 
recorded as program statements in an underlying 
language called Lynx. The message-passing 
characteristics are the basis for two key LinkWinds 
functions. The first is the maintenance of an 
internal journal of all user-originated commands 
executed by the environment. This file can be 
saved at any time through a menu option. The 
record can then be replayed at the initiation of 
subsequent LinkWinds sessions, allowing the user 
to draw on a previous layout of LinkWinds applica- 
tions and links or repeat a full analysis session. 

The second function based on the Lynx mes- 
sage-passing protocol is the multi-user science 
environment (MUSE), which provides a method for 
multiple LinkWinds systems to communicate via 
networks. Using menu options, remotely separated 
users can connect to one another and, by establish- 
ing a telephone voice connection, can cooperatively 
view and manipulate their data. A successful 
connection requires that each user be executing 
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LinkWinds and that each has access to the data sets 
being analyzed. This is normally arranged by 
transporting the data sets prior to the collaborative 
session. Because only the messages are sent, and 
not the actual data, a very low bandwidth is re- 
quired, making for quick and efficient communica- 
tion. The MUSE capability is also used to provide 
tutorials over the network to new users and to allow 
users to demonstrate recommendations for applica- 
tion changes or to point out bugs. 

Hard copy of the LinkWinds displays are 
provided by function keys on the keyboard. Placing 
the cursor in a window and pressing FI will pro- 
duce an image of a window’s contents; pressing F2 
will save the complete window and frame; and F3 
will save the full screen. The figures shown were 
obtained in this manner. 

3. DATA LINKING AND THE USER 

INTERFACE 

In addition to the normal GUI functions provided 
by the windowing environment, dynamic manipulation 
of graphs and images is facilitated through the data- 
linking paradigm. Data linking can be understood in 
the context of a spreadsheet, where cells containing 
numbers are linked to other cells. Formulas are 
associated with each cell, so that when a number 
changes, all cells linked to the changed cell recalculate 
their values. LinkWinds does the same thing, but in a 
graphics environment where the rigid grid structure 
gives way to free form, and a cell can translate, for 
instance, into sliders or large-scale number arrays such 
as images. 

This data-linking paradigm is one of the most 
distinguishing features of LinkWinds; it evolved from 
a desire to create a truly easy-to-leam and intuitive user 
interface. A guiding principle is that users are impa- 
tient and want to get started on productive work as 
quickly as possible. Large manuals only discourage 
them [M. Rettig, 1991]. Therefore, an interface was 
needed that can be learned by exploration and that 
conforms to expectations as the user works. 

Data linking is affected through two icons. The 
link icon is a button displaying two interlocking rings, 
while the unlink icon displays two rings that are 
separated. Objects on the screen may have a single 
link button, the full set of link and unlink buttons, or no 
buttons. The presence of a single link button indicates 
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a data object, while the presence of the pair indicates 
applications with control functions. A window with no 
buttons is an application with only display capabilities. 
To perform a link, the cursor is placed on the appropri- 
ate button, and a “rubber band” is dragged out and 
dropped into the application to be linked. To break the 
link, the same routine is followed using the unlink 
button. The rubber bands signify that the links may be 
either continuously displayed or hidden during the 
session, depending on the user’ s preference. There are 
two simple rules to follow in applying the linking 
paradigm: 

1 . When, as a result of menu selections, an 
empty window appears on the screen, put 
data in it. This is done by linking a data 
object into the window. 

2. When an object with the pair of link sym- 
bols appears, exercise its control function 
by linking it into any application object. 

Our experiences with users has confirmed the 
intuitiveness of this paradigm. Scientists quickly 
become familiar with the manipulations that control 
LinkWinds, often grabbing the mouse out of our 
hands during demonstrations to try themselves. New 
users typically require about 30 minutes of 
demonstration to understand the system well enough 
to embark upon productive work. Recently, groups 
of incoming Caltech graduate students were sent for 
a LinkWinds tutorial, and they quickly were 
performing competently. 

While the LinkWinds scheme of linking together 
objects on the screen is reminiscent of dataflow 
systems, the similarity is only superficial. 

LinkWinds is a multi-application execution environ- 
ment and therefore has significant architectural 
advantages. Any data transformations are done 
within high-level applications, which access the data 
through pointers to shared memory, negating the 
need to store intermediate copies of the data. This 
results in more efficient use of memory and better 
performance. LinkWinds is optimized for execution 
rather than programming and, with no need for 
linguistic completeness, is less complex and there- 
fore more easily learned. 

4. DATA BASE INTERFACE 

The current version of LinkWinds interfaces 
with both archived and real-time data. In the 


archived data mode, the primary data interface is 
with the widely accepted 8-bit raster and scientific 
Hierarchical Data Format (HDF), created and 
supported by the National Center for Supercomput- 
ing Applications (NCSA) at the University of 
Illinois, Urbana-Champaign. The 8-bit raster 
format allows three-dimensional data files to be 
constructed as a sequence of images. LinkWinds 
also accepts data in a byte format, the Silicon 
Graphics, Inc. RGB image format, and the Common 
Data Format (CDF), originated and supported by 
the Goddard Space Flight Center (GSFC). Addi- 
tional data formats are imported using DataHub 
[Handley and Rubin, 1 993], a system being devel- 
oped in close collaboration with the LinkWinds 
effort, and also supported by the NASA Applied 
Information Systems Research Program. DataHub 
provides LinkWinds direct access to a wide variety 
of data formats, including the Airborne Visible and 
Infrared Imaging Spectrometer (AVIRIS) format, 
Planetary Data System (PDS) format, and NetCDF, 
a derivative of CDF developed and supported by 
the National Center for Atmospheric Research 
(NCAR). The two programs are working together 
so that DataHub and LinkWinds may be called from 
inside the other, thus complementing their capabili- 
ties. LinkWinds will use DataHub for additional 
formats and to reduce large data sets, while 
DataHub will use LinkWinds for visual data 
selection, subsetting, and validation. 

Data sets to be ingested by LinkWinds are 
listed in a text file and appear in the top-level “data 
bases” menu. Wild cards may be used in this text 
file, allowing whole classes of data files to be 
accessible. New data files created during a session 
by DataHub or LinkWinds’ subsetting tool may be 
automatically added to the menu. Metadata needed 
to translate data and axis values to meaningful 
numbers, such as the number of axes and their 
names, are obtained from data files in the standard 
formats. A special metadata file exists for the raw 
byte data or to provide additional information not 
contained in the standard formats. 

The real-time mode of LinkWinds is a recent 
development [Jacobson and Berkin, 1993], and it 
must be exercised through the creation of a server 
tailored to the format of the input data stream. The 
data description file contains the name of the server, 
as well as the usual auxiliary metadata information. 
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The server is connected to LinkWinds through 
UNIX sockets. An interrupt system within 
LinkWinds senses the arrival of data and notifies 
the pertinent applications. In addition to the actual 
measured data, clock times and engineering infor- 
mation about the status of the sending devices may 
be read. Incoming data are saved in HDF 8-bit 
format and may then be replayed by requesting a 
replay server in the metadata file. Specific data 
files and time intervals may be chosen for replay. 

As a test demonstration, LinkWinds was used to 
monitor and analyze data from the University of 
Iowa’ s Plasma Wave Subsystem aboard the Galileo 
spacecraft during the Earth 2 encounter in Decem- 
ber 1992. 

5. APPLICATION TO ATMOSPHERIC 

DATA 

A suite of applications useful across many 
disciplines has been developed for the LinkWinds 
environment (see Table 1). Figure 1 shows a 
typical session to explore a data set collected by the 
Microwave Limb Sounder (MLS) currently in orbit 
aboard the Upper Atmosphere Research Satellite 
(UARS) [Waters et al., 1993], These data are three- 
dimensional, with the concentration of various 
chemical constituents varying with latitude, longi- 
tude, and pressure. The instrument has sensitivity 
over the range from 100 millibars (mb) to 0.1 mb. 

A new data set is collected daily; the data shown 
here are from day 56 of the mission, which is day 
3 10 of 1991. 

The LinkWinds top-level menu appears in the 
center left of Figure 1 . From this menu, the data 
bases, tools, and system options are selected. Data 
objects, with their single link buttons, are just above 
the menu. In this case, the data displayed are ozone 
and water vapor. The window, entitled Image 1, 
contains a slice of the ozone data at a pressure of 
21.54 mb, as selected by Slider 1, which is linked to 
it. Slider 1 is also linked to Image2, which shows the 
water vapor at the same pressure. Slider 1 permits 
the user to scan the full data set from the maximum 
to minimum altitudes. The user can also switch to 
any of the three orthogonal axes and similarly scan 
them. The amount of each constituent is given by 
color, as indicated on the color bars at the bottom of 
the images, with red denoting high concentration and 


blue low. Gray indicates missing data, where MLS 
could not measure because of the orbital position, 
orientation, and sensing range of UARS. 

The Southern Hemisphere ozone hole is clearly 
seen at the lower left of Image 1, and a high value of 
the water vapor is seen in the corresponding loca- 
tion of Image2. This anticorrelation is readily 
visible in Scatter 1 , where the points shown come 
from the region defined by a bounding box control 
embedded in Image and linked to Scatterl . This 
box, visible in the lower left of Image 1 , can be 
resized or moved as desired. For every point inside 
this box, Scatterl plots the ozone vertically versus 
water vapor horizontally. In the lower text box of 
Scatterl, statistical quantities associated with the 
scattered data are given. Of particular note is the 
closeness of the correlation coefficient to - 1 , 
indicating high linear anticorrelation. Other infor- 
mation, such as third and fourth moments, linear 
best fit data of the scatter, the number of points 
scattered, and the chi-squared value, may be 
selected from the Scatter menu. The full range of 
statistical information or the points themselves may 
be saved to files via buttons. 

Two other applications display data along one 
dimension. In the lower center of Figure 1, 
LinePlotl displays ozone concentration as a func- 
tion of pressure. The location in latitude and 
longitude of this plot is controlled by Image 1 , 
which has been linked into LinePlotl . The green 
plot corresponds to the frozen crosshair at the 
center of Image 1 , while the white plot corresponds 
to the red crosshair and is instantaneously updated 
as the crosshair is moved. The much lower white 
values reflect the paucity of ozone inside the hole. 
The Profile application to the left of LinePlotl 
shows ozone concentration along the cyan line in 
the slice of Image 1 . This line may be interactively 
updated. The current Profile line starts in the ozone 
hole, goes through an ozone-rich region, and ends 
back in the hole. This is reflected in both the colors 
of Image 1 as well as the height of the curve of 
Profile 1 , providing a different way of visualizing 
the data. 

Because this data set is global, Figure 2 shows 
the ozone data displayed in Globe 1 as both a color 
and height field rendered on a sphere. The ozone 
hole and adjacent regions of higher ozone are 
clearly seen. The water vapor data could have been 
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Figure I. LinkWinds session to explore upper atmospheric ozone and water vapor measured by the Microwave Limb Sounder aboard UARS. 
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2 . Examination of other features of upper atmospheric ozone measured by the Microwave Limb Sounder aboard UARS. 
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used for the height field, clearly demonstrating any 
correlations. Sliderl also sets the pressure slice of 
this display, and the height scale is determined by 
the rotary dial on the left side of the window. Pan- 
Zoom and 2- Axis Rotator controls linked to Globe 1 
allow it to be interactively positioned as desired. 

The Combine tool allows mathematical ma- 
nipulations to be performed on slices of data. A 
calculator called from Combine via a button con- 
tains the standard mathematical functions and 
allows the input of constants, as well as slices. The 
user can combine up to three slices of data, from the 
same set or mixed sets, in any mathematical expres- 
sion desired. In Figure 2, these slices have been 
selected by linking in LinePlot, with the red, green, 
and blue sliders setting the levels as 21 .54, 14.68, 
and 10 mb, respectively. The calculator has 
summed these three slices, and the resulting slice 
and histogram of the data distribution are displayed 
in Combine 1. 

Among the tools in Table 1 , two deserving 
special comment facilitate the creation of anima- 
tions for immediate display on the screen or subse- 
quent recording on video or film. Frame Animator 
is frame based, with the user selecting starting and 
ending control values, and the number of frames 
desired in the animation. The other Animator is 
time based. The user sets any number of control 
positions, each with associated key times graphi- 
cally selected by moving the hands on a clock. The 
Animator then interpolates between these set 
positions to easily make animations with great 
flexibility in the control. The desired frame rate is 
selected from the Animator menu. Rates include 
those for film, video, and a screen display mode, 
which makes the animation in real time. 

6. FUTURE PLANS 

Several developments are planned for the future 
to significantly improve the usefulness of 
LinkWinds. As in the past, we will continue to 
develop applications in collaboration with science 
users seeking to solve real problems. Where 
relevant, these applications will make use of 
modern rendering techniques that can be applied 
successfully in an interactive environment. 

A major impediment to the use of any visual- 
ization tool is the difficulty users have in inputting 


their data. Developments of LinkWinds and 
DataHub will continue to make this process as 
seamless and automatic as possible. A related issue 
is the types of data addressed by LinkWinds. 
Available tools for visual data analysis are gener- 
ally confined to relatively well-behaved and rectan- 
gularly gridded data sets. LinkWinds’ first venture 
from the common mold was into the realm of real- 
time data, creating a capability to ingest such data 
and build interacti ve applications for monitoring 
and analyzing it. 

There are other major neglected categories of 
data that are quite common in scientific research 
and badly in need of tools to support their 
exploration and analysis. Several problem areas to 
be addressed in the future are (1) data sets in which 
there are significant sources of error, either 
statistical and/or systematic; (2) data sets that are 
ungridded samples, either sparse or numerous, from 
which the user desires to construct gridded data sets 
over extended regions; and (3) disparate-sized data 
sets from a variety of instruments that must be 
warped and/or co-registered for overlay or 
comparison. 

In the future, we intend to pursue the develop- 
ment of a users’ applications generator for 
LinkWinds. Currently, the layout of the objects, or 
widgets, in all of the windows is determined by a 
text file. The user can reconfigure these windows 
either by editing this file or interactively from a 
menu-selectable “redesign” mode. We intend to 
expand this tool kit approach to further allow users 
to throw widgets away, or add new widgets from a 
provided catalog. In conjunction, LinkWinds will 
generate a C code source module to make these 
widgets work. This code will be suitable for use as 
a template for the development of a full application. 
As experience is gained with this approach, and a 
planned conversion of the code to C++ is accom- 
plished, we anticipate that the rendering and di splay 
processes will also lend themselves to a limited 
catalog of processes selectable by the user. 

Aware that the only way to develop useful tools 
is in conjunction with research on meaningful 
problems, our policy has been to encourage users 
and potential users to contact us concerning 
LinkWinds’ changes and needs. We have re- 
sponded and will continue to do so to the limit of 
our resources. LinkWinds is currently in use at 
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Table 1. Current Suite of LinkWinds Applications. 



2-Axis Rotator 


Pan-Zoom Slider 


Combine Slider 


Animator 


Frame Animator 



Streamline 


StreamPlane 


JhannelShder 


Control 


Controls which slice of data is displayed along any of three orthogonal axes 


Controls the slice of data displayed along the three orthogonal axes (three sliders 


Rotates 3-D applications along three orthogonal axes (three sliders) 


Moving in a 2-D area, rotates 3-D applications (one slider) 


Controls 3-D applications by changing the viewpoint 


Determines three slices of data and their three constant offsets (six sliders) 


Provides time-based animations of any application with an arbitrary number of key 
frames, with a varietv of frame rates 


Provides frame-based animations of any application, with only the start and stop 
control values being set 


Involves interactive data set palette manipulation allowing color editing, data ranges 
being ramped in color, or substitution of pre-defined palettes 


Display/Control 


Combines up to three slices of data using standard mathematical functions entered 
in an embedded calculator 


Compares the functional behavior of each point in a data set with a reference point 
using a variety of mathematical functions 


Allows the user to interactively save portions of the displayed data into HDF 


Plots the values along a straight line going completely through a data set parallel to 
any axis, and also functions as a slider to select three slices 


Displays the distribution of values in the 256 data channels for up to three slices, and 
provides filtering and color stretching 


Displays a single slice of data or a composite RGB image of three slices with 
embedded crosshair, bounding box, and line controls 


Display 


Polygonally renders an image in perspective relief, with an optional accompanying 
height field, of either a single slice or RGB three-slice composite 


Polygonally renders an image on a globe, with an optional accompanying height 
field, of either a single slice or RGB three-slice composite 


Displays in a 3-D point-by-point rendering all the values in a data set between two 
limits 


Displays the data values along a line drawn on the Image, Combine, or Compare tools 


At every location in a slice, plots the values of one data set against the other to show 
the correlation 


At every location in a slice, plots the values of three data sets in 3-D to show their 
correlation 


Gives numerical information about the data in Image, Combine, or Compare, both 
at a point and averaged over a bounding box 


Real Time 


Display data (through a strip-chart recorder) as either color or line plots, as a 
function of channel number vertically and time horizontally 


Plots data value versus channel number, updating in real time with options of 
saving a spectrum and averaging in time 


Functions much like StreamPlot, but with the data polygonally rendered in relief 
using the data values for height and color 


Provides timing information tor the current data, giving date, day of year, time of 
day, and internal spacecraft time 


ontrols the range of channels to be viewed, allowing concentration on features of 
interest 





















Chapter III: Lower Atmospheric and Earth Science Applications 


183 


more than ten institutions, being applied to prob- 
lems in remote-sensed and field geology, atmo- 
spheric physics and chemistry, meteorology, 
oceanography, chemical spectroscopy, space 
plasmas, genetics, and cellular biology. As it 
evolves, we expect to interact with a wider distribu- 
tion of scientists engaged in research spanning 
additional scientific disciplines. 
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Temperature data derived from the Microwave 
Sounder Unit (MSU) provides an opportunity for 
investigating atmospheric temperatures on a global 
scale since 1979. Fourteen years of global data sets 
of daily temperature anomalies within the lower 
stratosphere and lower troposphere are being gener- 
ated at NASA Marshall Space Flight Center. 

LinkWinds, a visualization/analysis package under 
development at NASA Jet Propulsion Laboratory, 
has been extremely useful for validating and analyz- 
ing these data sets. LinkWinds provides the ability to 
interactively scroll and animate through the 10,220 
images of temporal data, to selectively slice and view 
the data along latitude, longitude, or temporal axes, 
to interactively analyze spatial and temporal vari- 
ability within the data, and to perform correlative 
analysis between various elements of the data. These 
capabilities have been invaluable in allowing the 
recognition of processing artifacts, as well as the 
effects that physical phenomena, such as the El Ninos 
effects and the Mt. Pinatubo eruption, have had on 
atmospheric temperatures. 


1. INTRODUCTION 

The availability of global data sets extending 
over relatively long periods is vital to global change 
research. With regard to atmospheric temperature 
measurements and the unresolved issue of global 
warming, the most temporally extensive data sets 
are, unfortunately, derived from sparse ground 
stations that are often located near rapidly growing 
urban areas or other areas greatly affected by 
localized conditions. Furthermore, surface-based 
temperature measurements in oceanic or other 
remote regions are sparse in both temporal and 
spatial domains. Previous studies have shown that 
the Microwave Sounder Unit (MSU) sensor, located 
on several National Oceanic and Atmospheric 
Administration (NOAA) satellites since 1979, 
provides an excellent measure of atmospheric 
temperature variability for the entire Earth [Spencer 
etal.,1990]. 

Methods have been established for consolidat- 
ing 14 years of data from several MSU sensors and 
for gridding and calculating global atmospheric 
temperature anomalies for the lower troposphere 
and lower stratosphere [Spencer etal., 1991; 
Spencer and Christy, 1992a, 1992b]. Data sets of 
limited temporal resolution and range were gener- 
ated during the previous studies to validate the 
techniques against ground-based and radiosonde 
data. 

At Marshall Space Flight Center (MSFC), daily 
atmospheric temperature anomaly data sets have 
been generated from the MSU sensor data, for the 
1 4-year period extending from January 1 979 to the 
present, for both the lower troposphere and lower 
stratosphere. These data sets are undergoing 
validation and quality control at MSFC, before 
being made available for distribution to global 
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change researchers. Although some validation 
and quality control have been accomplished using 
established data-processing procedures, The Linked 
Windows Interactive Data System (LinkWinds) is 
being used extensively to provide additional quality 
control through interactive visualization and 
analysis of the entire data set. 

2. THE DATA 

The present MSU temperature anomaly data 
sets consist of daily global images for 14 years 
from 1979 to 1992, for both the lower troposphere 
and lower stratosphere. This calculates to 10,220 
images. The temperature anomalies are mapped to 
a 2.5° grid, resulting in an image resolution of 
144 x 72. At present, the data sets are stored in the 
8-bit image Hierarchical Data Format (HDF), with 
1 year (365 to 366 images) of data per HDF file. 
The stratospheric and tropospheric anomalies are 
divided into separate files. Each HDF file is about 
3.8 MB, or more than 100 MB of total data for the 
entire 14-year period. 

The MSU daily temperature anomaly data sets 
are derived from a series of processing steps, each 
of which can be a potential source of error. These 
steps include: 

e Calibration of raw MSU antenna brightness 
data 

8 Retrieval of atmospheric temperature from 
various MSU channels 

• Navigation/coregistration 

• Consolidation of data from multiple sensors 
(i.e., adjusting intersatellite calibration 
differences) 

0 Limb correction (i.e., off-nadir corrections) 

• Calculation of an average daily temperature 
for a given location, based on the entire 

data set 

8 Daily averaging and spatial gridding 

It is also known that several natural factors can 
cause artifacts within the data, including the pres- 
ence of short-lived isolated thunderstorm cells, 
which can cause a bias in the daily average tem- 
perature measurement, and high surface elevations, 
which can result in erroneous tropospheric tempera- 
ture measurements caused by interference from 
ground temperatures. 


3. THE TOOL 

LinkWinds is a visualization/analysis tool 
under development at NASA Jet Propulsion Labo- 
ratory (JPL) [Jacobson, 1992; Jacobson andBerkin, 
1993a, 1993b], LinkWinds provides an intuitive 
and highly interactive interface for data exploration 
and analysis, as well as for intercomparison and 
correlation of multiple data sets. LinkWinds is best 
suited for exploring or comparing “datacubes” or 
“stacked” data, which might consist of two spatial 
dimensions (such as latitude and longitude) and a 
third dimension of either spectral bands, time, a 
collection of variables, or the third spatial dimen- 
sion (such as altitude or depth). However, 
LinkWinds does not restrict the user’ s choice of 
axes, so that any two- or three-dimensional data can 
be analyzed. Furthermore, LinkWinds allows 
correlative analysis between two or more similarly 
configured datacubes, in addition to extensive 
exploration and analysis within a single datacube. 

In the case of the MSU data sets, LinkWinds 
provides the ability to interactively scroll through 
the 10,220 images of temporal data, to selectively 
slice and view the data along latitude, longitude, or 
temporal axes, to interactively view and analyze 
spatial and temporal variability for a given position 
or region of interest, and to combine and correlate 
various elements of the data. Rudimentary data 
base management is accomplished in LinkWinds 
through the use of a user-defined hierarchical menu 
structure. The user is provided a selection of tools 
into which these data can be loaded, including: 

• Image display tools for two-dimensional 
and three-dimensional data display 

• Data feedback tools for line plots, histo- 
gram display, and pixel and bounding box 
tracking 

• Tools for intercomparing data sets, such 
as two-dimensional and three-dimensional 
scatter plots, and compare and combine 
tools 

• Frame-based and time-based animation 
controllers 

• Control widgets, such as sliders, rotators, 
pan-zoom controller, color table editor, and 
data subsetting tool 

Many of the display tools also have associated 
internal controllers that can be linked to other tools. 
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For instance, the histogram tool allows the user to 
perform rudimentary image enhancement within 
image , globe, and plane tools, while line plot 
allows the selection of components for a three-band 
(RGB) composite image. 

Controlling functions can be linked to more than 
one tool, allowing simultaneous control overthe 
analysis and display of two or more data sets. 

Figure 1 shows an example of the MSU data set 
and some of the tools available within LinkWinds, 
including the image, globe, histogram, line plot, 
and combine with calculator tools. 

4. DATA SET VALIDATION AND ANALYSIS 

Although some validation and quality control of 
the data set can be achieved by analytical methods, 
the interactive visual exploration of the data pro- 
vided by LinkWinds has proved invaluable for 
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rapidly spotting erroneous data or artifacts beyond 
those found by analytical means alone. Having the 
ability to very rapidly scroll through a year or more 
of daily images has been useful for locating gross 
errors, as well as errors that may not be apparent on 
single images, but that become noticeable during 
image animation. The investigation of discovered 
artifacts was able to be refined and new sources of 
error discovered using several additional capabili- 
ties of LinkWinds, including in particular the ability 
to slice and scroll the data sets along any orthogo- 
nal axes, to generate line plots along these axes, and 
to retrieve additional feedback through the histo- 
gram and pixel tracking tools. In addition, an 
initial investigation of the relationship among 
various parameters was possible using the 
scatterplot, combine, and compare tools. 

These capabilities have been invaluable in 
allowing the recognition of processing artifacts, as 
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Figure 1. An example of a few of the tools available within LinkWinds, including image, globe, histogram, line plot, and combine 
( with associated image calculator). 


well as the effects that physical phenomena have on 
atmospheric temperatures. The features and artifacts 
observed using LinkWinds included: 

• V alidation/qual ity control 

V alid data ranges/bad pixels 
Surface elevation effects (mountains) 
Misplaced orbits 

Striping caused by improper limb 
correction 

Bug in software where arrays not 
cleared 

Thunderstorm signatures 

• Analysis 

El Ninos effects 

Mount Pinatubo eruption effects 
Quasi-biennial oscillations 
Strong inverse correlation between 
tropospheric and stratospheric anoma- 
lies in summer hemispheres 
The initial stage of validation of the data set 
using LinkWinds was to determine the range of 
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valid temperature anomaly values. The initial range 
of the data extended between about -40 to +40°C. 
LinkWinds provided the ability to interactively 
examine the validity of pixels in the lower and 
upper ranges of the data. As illustrated in Figure 2, 
the color table editor was used to set the color of 
the “bad pixel” value to a very conspicuous color — 
in this case, bright green. Setting the sliders on the 
histogram tool to various upper and lower limits, 
pixels that exceed these values took on the “bad 
pixel” color and could easily be distinguished from 
other pixels. It was then possible to quickly slide 
through the sequences of images and determine at 
what upper and lower values the pixels seemed to 
be valid, rather than spurious, as was the case in 
Figure 2. Using this procedure, it was determined 
that the valid range for these temperature anomaly 
data was between -20 and +20°C for the lower 
troposphere and between -25 and +25°C for the 
lower stratosphere. 
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Figure 2. Example of "bad pixel" values determined using LinkWinds image, histogram, color, and pixel tracking tools. 
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Ground temperature contamination of tropo- 
spheric measurements in areas of high surface 
elevation is difficult to recognize within a single 
image, but is easily recognized by two methods 
within LinkWinds. First, regions of consistent 
spurious values often become recognizable while 
scrolling or animating through the temporal se- 
quences of images. This method is difficult to 
illustrate within a publication. The second method 
takes advantage of the ability to view and scroll 
data along various axes within LinkWinds. As 
illustrated in Figure 3, the image tool can be set to 
view a slice of the data along a latitude-time plane 
by means of a pop-up menu. A slider then can be 
set to scroll through longitude space until spurious 
linear discontinuities are spotted parallel to the time 
axis. In Figure 3, the days run from 1 to 365 along 
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the horizontal axes, and latitude along the vertical 
axis. A discontinuity is shown, which persists 
throughout the year at 20.9S, 66. 7W, and it results 
from the presence of the Andes Mountains. The 5 -day 
periodicity of the microwave signal, as determined 
using the line plot tool, is apparently related to 
changes in the satellite viewing angle from nadir to 
oblique. 

Other artifacts that were discovered in the initial 
MSU data sets included misplaced orbits and 
striping caused by improper limb corrections, shown 
in Figure 4a and Figure 4b, respectively, as well as 
instances where the data array had not been set to 
zero for areas of missing data. In addition, unreason- 
ably high temperature gradients, indicative of the 
presence of local thunderstorms, were discovered 
and corrected. Many of the artifacts in the data sets 
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Figure 3. A latitude-day slice of the MSU tropospheric temperature anomaly data showing the presence of spurious data at 20. 9S, 
66. 7W throughout the year, resulting from the presence of the Andes Mountains. The 5-day period of the artifact, as determined 
with the line plot tool, results from changes in the satellite viewing angle from near-nadir to oblique. 



Figure 4. Additional artifacts discovered in the MSU temperature anomaly data using LinkWinds, including (a) misplaced orbits 
and (h) striping caused by improper limb correction. 


would not have been recognized without the exper- 
tise of an atmospheric scientist, adding credence to 
the need for tools, such as LinkWinds, which are 
developed for and operated by the scientist. 

In addition to recognizing undesirable artifacts, 
LinkWinds has also provided the opportunity to 
easily observe relationships between various natural 
phenomena and global atmospheric temperature 
anomalies. These phenomena include El Nino/ 
southern oscillations (ENSO), quasi-biennial 
oscillations (QBO), volcanic eruptions, and a strong 
inverse correlation between stratospheric and 
tropospheric temperature anomalies within the 
summer hemisphere. Figure 5 shows the tropo- 
spheric warming in the tropics as a result of ENSO 
of 1983. 

In Figure 6, longitude-day slices of the strato- 
sphere, at 1 6.9N latitude, are shown for 1 990, 1 99 1 , 
and 1992. The relative cooling of the lower strato- 
sphere, possibly the indication of a QBO, can be 
seen in the second half of 1990 and the first half of 
1991. As seen in Figure 6, this cooling cycle was 
interrupted by rapid stratospheric warming as a 
result of the Mt. Pinatubo eruption on June 27, 1991 
(day 191). 

An additional relationship that was observed 
with LinkWinds was a strong inverse correlation 
(correlation coefficient up to 0.93) between tem- 
perature anomalies in the lower troposphere and 


those in the lower stratosphere within the summer 
hemisphere. In contrast, the stratospheric and 
tropospheric anomalies in the winter hemisphere 
consistently exhibit no correlation. Using the 
LinkWinds scatterplot tool, as shown in Figure 7, 
one can rapidly scroll through the year’s worth of 
data and observe the daily scatterplot for the entire 
globe or for a user-selected subregion. Within each 
hemisphere, the stratospheric and tropospheric 
MSU temperature anomalies are observed to 
become highly inversely correlated as that 
hemisphere’s summer months are approached, and 
then progressively degenerate to acondition of poor 
correlation as the winter months are approached. 
During the high correlation periods in the summer, 
the slope of the stratosphere/troposphere correlation 
curve is relatively consistent from year to year; it is 
around - 1 .45 ± 0. 1 5 for the Northern Hemisphere 
and -1.35 ±0.1 for the Southern Hemisphere. The 
intercept for the correlation curve varies from about 
0 to 1 .5°C for the Northern Hemisphere and from 
about - 1 .5 to 2.0°C for the Southern Hemisphere. 

5. CONCLUSIONS 

LinkWinds provides an intuitive and highly 
interactive interface and significant capabilities 
for validating and analyzing large data sets. Ad- 
equate quality control of 1 0,220 images within the 
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Figure 5. A latitude-day slice of the strospheric and tropospheric MSU temperature anomalies showing the tropospheric warming 
in the tropics during the El Niho/southern oscillation (ENSO) in the first half of 1983. 



Figure 6. Longitude-day slices at I6.9N latitude for stratospheric anomalies in the years 1990, 1991, and 1992. Note relative 
cooling from mid- 1 990 until mid- 199 1, possibly indicative of a quasi-biennial oscillation (QBO), which was suddenly interrupted 
by stratospheric warming resulting from the Ml. Pinatubo eruption of July 27, 1991. 
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Figure 7. Scatterplot of tropospheric versus stratospheric temperature anomalies during the summer in the Southern Hemisphere. 
Region of interest delineated by the bounding box in the troposphere image. Note correlation coefficient = -0.92 and other 
relevant information provided in the statistics box of the scatterplot tool. 


MSU data set would not have been possible without 
LinkWinds, or a tool of its caliber. The ability to 
interactively compare and combine various param- 
eters within a single tool makes LinkWinds an 
excellent tool for comparative analysis of multisen- 
sor and multiparameter data sets. LinkWinds also 
provides a promising development environment for 
future tools or for extension of the present 
LinkWinds functionality. 

REFERENCES 

Jacobson, A.S., LinkWinds: An Approach to Visual Data 
Analysis, Proceedings oflnt’l Symposium on Spectral 
Science Research, Maui, Hawaii, November 1992. 

Jacobson, A.S. and A.L. Berkin, LinkWinds, A System 
for Interactive Scientific Data Analysis and Visualiza- 
tion, Proceedings of High Performance Computing 
Conference , Soc.for Computer Simulation, 214, 
March 1993a. 

Jacobson, A.S. and A.L. Berkin, LinkWinds, A Visual 
Data Analysis System and its Application to the 
Atmospheric Ozone Depletion Problem, Transactions 
oftheAGU, 1993b. 


Spencer, R.W. and J.R. Christy, Precision and Radio- 
sonde Nalidation of Satellite Gridpoint Temperature 
Anomalies, Part II: A Tropospheric Retrieval and 
Trends During 1979-90, J. Climate, 5, 858-866, 
1992a. 

Spencer, R.W. and J.R. Christy, Precision and Radio- 
sonde Validation of Satellite Gridpoint Temperature 
Anomalies, Part I: MSU Channel 2, J. Climate, 5, 8, 
847-857, 1992b. 

Spencer, R.W. and J.R. Christy, Precision Lower 
Stratospheric Temperature Monitoring With the 
MSU: Validation and Results 1979-91,7. Climate (in 
press), 1993. 

Spencer, R., J. Christy, and N. Grody, Global Atmo- 
spheric Temperature Monitoring with Satellite 
Microwave Measurements: Method and Results 1979- 
84, 7. Climate, 3, 10, 1 1 1 1-1 128, 1990. 

Spencer, R., J. Christy, and N. Grody, Precision Tropo- 
spheric T emperature Monitoringl979-1990, 
Palaeogeography, Palaeoclimatology, 
Palaeoecology,90, 113-120, 1991. 



Interactive Data Exploration 
and Particle Tracking for 
General Circulation Models 


R.I. ROSENBAUM, R.L. PESKIN, S.S. WALTHER 

Center for Computer Aids for Industrial Productivity, 

Rutgers University, New Brunswick, NJ 

H.P. ZINN 

NASA Goddard Institute for Space Studies, New York, NY 

The SCENE environment for interactive visualization of 
complex data sets is discussed. This environment is used 
to create tools for graphical exploration of atmospheric 
flow models. These tools may be extended by the user in 
a seamless manner, so that no programming is required. 

A module for accurately tracing field lines and particle 
trajectories in SCENE is presented. This is used to 
examine the flow field qualitatively with streamlines and 
pathlines and to identify critical points in the velocity 
field. The paper also describes a visualization tool for 
general circulation models on which the primary fea- 
tures of the environment are demonstrated. 
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1. INTRODUCTION 

W e have applied a distributed computing 
interface called SCENE (Scientific Computation 
Environment for Numerical Experimentation) to the 
analysis of atmospheric data sets. SCENE includes 
a “front-end” workstation, “back-end” computers, 
and software. The environment is designed to allow 
scientists and engineers to undertake numerical 
experimentation without programming, interact 
with their numerical experiments via interactive 
graphics, and have total access to the results 
through easy manipulation of a variety of graphical 
output structures. These structures are designed to 
enhance the user’ s ability to obtain quantitative 
information from graphics [Peskin et a!., 1 992] . 

The back-end systems support intensive nu- 
meric and symbolic computations, and this compu- 
tational power is available to the user in a transpar- 
ent manner. Thus, SCENE provides a way for the 
user to access high-performance computation 
without requiring direct involvement in complex 
aspects of computer programming systems. The 
environment is fully object oriented in design so 
that objects such as vectors and triangles in SCENE 
exist just as they do in physical models. This 
permits rapid and accurate prototyping of visualiza- 
tion tools for complex data sets. The environment 
employs a data base that recreates in memory the 
logical connectivity that exists in the grid of the 
data set. Maintaining a direct connection between 
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set is the first priority of this data storage and 
retrieval model [Walther and Peskin, 1993]. 
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1.1 Object Editor 

To create a SCENE visualization tool for 
atmospheric data, the user provides information 
about the data for which a tool is to be constructed. 
The facility for entering this information is called 
the Object Editor, shown in Figure 1 . This mouse- 
driven tool is used to enter the size, orientation, and 
format of the data, grid geometry or unstructured 
connectivity information, and file locations. At this 
point, the user gives each field in the data set a 
descriptive name and identifies it as vector or 
scalar. The user then gives the data description a 
name and saves it to a file. The Object Editor uses 
this information to create a visualization tool whose 
back-end processing is performed on any specified 
host. 

1.2 User Filters 

Often, large scientific computation data sets are 
limited in the number of dependent variables that 
are available from the base computation. For 
example, a large CFD computation may contain 
only the basic velocity field vectors in its output. 

If during post process examination, vorticity 


information is required, additional large-scale 
processing usually is needed. We have developed a 
new type of data structure that allows very flexible 
access to large data sets. By coupling this data 
structure with the equation editor we can offer the 
user the ability to generate interactive filters for 
these large data sets. This reduces the need for 
large-scale recomputation [Peskinetal., 1992], 

User-added filters are specified as vector or 
scalar operations on existing fields. Filters are 
entered as mathematical expressions which are 
parsed and passed to a symbolic solver that under- 
stands the structure of the data. New fields are 
computed from the user-added filters based on 
interpolations of the existing fields across the 
domain. Once computed, the tool treats a field 
generated by a user-added filter as it would the 
original data. It may be visualized graphically, or 
used as the basis of further calculation. The nu- 
meric values of a field generated by a user filter 
may be examined within the tool . 

1.3 Grid Issues 

SCENE supports meshed data in structured and 
unstructured formats. For structured grids, general 
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Figure 1. Object Editor used to create a SCENE visualization tool for a General Circulation Model data set. The boxes on the top and left 
indicate the structure of the datafiles and how they are to be displayed. The fields to be visualizedfrom the datafiles are shown in the center. 
Below these are equations entered as user-added filters. The syntax is that of the MAPLE symbolic computing package. 
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curvilinear coordinates are supported. At every 
point in the structured grid, the directional deriva- 
tives of the mesh are calculated to form a local 
Jacobian of the coordinate transformation. The grid 
metrics are used to specify geometrically proper 
vector operators, such as grad, div, and curl 
[Greenberg, 1978]. Proper formation of the grid 
metrics is a prerequisite for integrating particle 
pathlines. 

For general curvilinear coordinates, the partial 
derivatives are computed at every point in the grid 
using five-point differencing. These define a 3 x 3 
matrix, which is the inverse of the Jacobian of the 
coordinate system. The inversion of this matrix at 
every point is not particularly expensive, because 
the matrix is small and the routine only needs to be 
performed once for a given grid. 

For unstructured meshes, connectivity information 
about the mesh must be supplied. The basic elements 
of the two-dimensional unstructured mesh are tri- 
angles, each of which must contain information about 
its vertices and its neighbors (triangles that border its 
sides). Hi gher order computations are possible only on 
meshes that contain still more connectivity information 
[Froncioni and Peskin, 1993]. 

1.4 Visualization Options 

From the Object Editor, a mouse click will 
launch a visualization tool for the data. All plotting 
options are available on fields generated from user 
filters as well as the stored data. The functionality 
is the same for gridded and unstructured meshes. 

The tool has four menus, which set visualization 
options. Menu one controls file input and output. 

As soon as the tool is created, the user loads the 
specified data in the Object Editor. If the user filter 
is used to calculate additional fields, these may be 
saved to files. Menu two sets the plotting option. 
Available graphical views include contour plots, 
vector plots, particle tracking, wireframe drawing, 
and surface plots. Menu three controls sub view 
options for row plots, column plots, zooms, and 
point probes. Menu four is used to set the field that 
is to be visualized. Figure 2 shows several of the 
available options. 

In a SCENE visualization, the graphical view is 
always linked directly to the underlying data. The 
user can probe the numerical data by clicking the 
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mouse on the view. A window will pop up, which 
displays the numeric value and type (vector or 
scalar) of every field at that point. A subview may 
be created of any portion of a graph by dragging the 
mouse pointer over an area of the view. The 
subview is not merely a magnification of the 
original drawing, but rather a redraw that might 
indicate features too small to be resolved in the 
original view. Zoomed views have the same func- 
tionality as the original graphs, including the data 
probe. The user also has the option to zoom again 
in the subview and to zoom in on any number of 
different regions from the original vie w . 

2. PARTICLE TRACKING 

2.1 Trajectories in Steady and Unsteady Flow 

SCENE has the capability to trace the field 
lines of any vector in the data set. The field lines of 
the velocity vector describe the trajectory of a 
massless particle for steady flow, which defines a 
streamline. For a time-dependent data set, the user 
has the option to trace true pathlines in the un- 
steady flow or streamlines at an instant (Figure 3). 
Pathlines show the time history of single particles 
as they are advected with the flow. If the flow is 
unsteady, particle trajectories may cross them- 
selves, while streamlines never cross. In Figure 4, 
the red pathline and the blue streamlines are traced 
from the same region of the western Pacific Ocean. 
The trajectory of the pathline is similar to one of the 
streamlines, but the unsteady tracer moves errati- 
cally. In steady flow, the streamlines and pathlines 
coincide. 

2.2 Trajectory Options 

The particle-tracing option (path tool ) has 
functions that provide flexibility to the user. For 
example, the starting coordinates of a pathline may 
be specified from a pop up menu, which allows 
highly accurate specification of the location. The 
user may instead start pathlines at a location chosen 
with the mouse. A third option is to specify starting 
locations in a file, which is useful for tracing a set 
of pathlines repeatedly. If starting locations are 
read from a file, the color of each line may be 
specified as well. 
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Figure 2. Graphical interface for SCENE. The primary window (GCMloot, upper rightjshowsa contourplot ofvorlicity. with color key (lower righothalwaschosenfromapop up menu. 
Additional views have been spawned from the subviews menu (third menu from the left in the GCMtool control panel ). These are column (upper left) and row (lower left) plots along the 
lines indicated in the primary window and point probes of the data ( lower center) at the intersection of these lines. 
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Figure 3. Streamlines at the 0.25-kg/m 3 density altitude of the G1SS-GCM for a typical model year. Velocity field lines are traced 
from points with roughly 10 degrees of latitudinal separation. The streamlines demonstrate that the flow is predominantly zonal 
in the middle latitudes, while the tropics are more complex and contain many recirculation zones. 


90 0 


45.0 


0 U 


-45 0 


-90 0 




-180 0 - 90.0 0 0 90 0 190 0 

Figure 4. Pathlines ( red, magenta, and green) and streamlines (cyan) for the 0.25-kg/m 3 density altitude of the GISS-GCM. These tracers 
were released interactively by the user to locate interesting regions of the flow in both the transient and instantaneous domains. 
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The time step for trajectory integration is 
adjustable, and both positive and negative step sizes 
are valid. Pathlines are traced by integrating in 
time from any starting point. The usual method of 
integration is fourth-order Runge Kutta, but lower 
order integration methods have been considered for 
situations where computing resources are limited. 

To use the Runge Kutta method, four interpolations 
into the grid are required for each time step, and 
each interpolation is numerically expensive. As an 
alternative, a second-order, modified Taylor 
method suggested by Westerveld (1989) has been 
incorporated into the path tool for general curvilin- 
ear coordinates. For this method, derivatives of the 
velocity field are precomputed and stored at each 
grid point. 

3. VISUALIZATION OF GENERAL 
CIRCULATION MODELS 

As part of a study comparing numerically 
calculated particle trajectories with balloon obser- 
vations from the TWERLE experiment [Er-El and 
Peskin, 1981], a SCENE visualization environment 
has been created for data generated by the General 
Circulation Model at the Goddard Institute for 
Space Studies (GISS-GCM). The purpose of the 
GCM tool is to explore the transient flow field and 
analyze the motion of many particle trajectories. 

The model uses a grid of 36 lines of longitude by 24 
lines of latitude with nine atmospheric levels. A 
finer resolution is also available with 72 longitudes 
and 46 latitudes. The grid covers the entire surface 
of the Earth, and the nine vertical layers extend 
from the planetary surface to well above the jet 
stream. The Earth is gridded in spherical coordi- 
nates with a constant radius. The available data 
contain 1 2 monthly averages to simulate a typical 
calendar year. Another set contains 3 1 daily 
averages for January 1979. The nine vertical 
coordinates are set at constant levels of pressure 
difference. 

3.1 Data Representation 

Three coordinates are required to specify a 
point on the surface of the Earth in Cartesian 
coordinates (such as length, width, and height from 
the Earth’ s center). If spherical coordinates are 


Visualization Techniques in Space and Atmospheric Sciences 

used instead, only two coordinates (longitude and 
latitude) are necessary. This is a substantial advan- 
tage, but it creates a problem of grid degeneracy at 
the poles. The existence of poles prevents SCENE 
from using the techniques developed for general 
curvilinear coordinates. For this reason, support for 
spherical coordinates is a separate option that is 
built into the GCM visualization tool. This option 
provides the user with the ability to perform vector 
operations that are even more accurate than those 
for curvilinear coordinates because the grid metrics 
are described analytically. To account for the non- 
affine mapping of the Earth onto a rectangle, the 
interpolation is weighted by projected area. This 
correctly accounts for the latitudinal variation of a 
cell’s area. 

In addition to the basic advection equations, 
GCM integrates the energy sources to the flow 
whose effects may be highly localized, while the 
resolution of the model is quite coarse. For this 
reason, first-order interpolation is used wherever 
possible. Higher order interpolation, which sacri- 
fices locality for higher nominal accuracy, may 
produce undesired smoothness in the results. For 
example, GCM ’s differing representations of sea 
and land surfaces generate a steep gradient at the 
coastlines, especially at the mountainous west 
coasts of the American continents. These gradients 
persist into the higher levels of the atmosphere and 
the effects of these gradients are diminished when 
higher order interpolation is used. 

3.2 Data Reduction 

The volume of data taken from GCM was quite 
large — 46 latitudes x 72 longitudes x 9 levels x 3 1 
days. The two vector field’ s position and velocity 
were stored, along with two thermodynamic scalars 
(pressure and potential temperature). In single 
precision, this is 29 megabytes of data. While 
SCENE is able to handle large data sets, only one 
plane of constant density altitude is of interest for 
comparison with the TWERLE experiment. Also, 
the GCM data are generated on surfaces of constant 
relative pressure difference (sigma levels), so the 
data are not directly comparable with the experi- 
mental results. To trace Lagrangian particles that 
model the balloons accurately, isosurfaces of 
density were formed by interpolating between the 



Chapter III: Lower Atmospheric and Earth Science Applications 

pressure surfaces. One constant density surface 
was formed for each day, thus reducing the dimen- 
sionality of the problem to three: latitude, longi- 
tude, and time. The storage is therefore (2 x 3d 
vectors + 2 scalars)*46 x 72 x 3 1 *(4 bytes per 
float) = 3,285,504 bytes, which is managed by 
SCENE running on a single workstation. 

4. QUALITATIVE DESCRIPTIONS OF 
ATMOSPHERIC FLOW 

The global atmospheric flow field contains 
large-scale structures that are analyzed with the 
GCM tool. The primary means of describing the 
instantaneous flow field is streamline tracing. 

Figure 3 shows a streamline pattern created with the 
path tool at one time step. The expected pattern of 
smooth flow at the middle latitudes and recircula- 
tion in the tropics is apparent. Terminating stream- 
lines indicate flow sources and sinks in the two- 
dimensional flow field. A streamline that spirals 
into a stagnation point may indicate a true sink in 
the compressible flow field. However, the flow 
field of Figure 4 is at a constant density, so the 
streamlines terminate where the flow is predomi- 
nantly vertical. 

4.1 User-Added Filters on GCM Data 

The vertical component of the GCM velocity 
field is a diagnostic variable that was not included 
in the data set. The GCM tool was therefore used 
to generate information about the flow in this 
direction. For incompressible flow, conservation of 
mass indicates that the vertical flow is found by 
integrating the continuity equation over the thick- 
ness of the atmospheric layer. The term inside the 
integral is the divergence of the two-dimensional 
flow field, the equation for which is entered as a 
user filter in Figure 1 . The full solution for the 
vertical velocity requires a constant of integration at 
every grid cell, but the information from GCM 
required to calculate this constant was not included 
in the available data set. 

4.2 Transient Flow Description 

The transient properties of the flow field are 
less easily analyzed, partly because of the higher 
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dimensionality of the data. While it is not difficult 
to create a contour plot with time as one coordinate 
direction, the interpretation of such a picture is not 
always apparent. Also, streamlines are informative, 
but they do not give a complete description of a 
transient flow. The primary means of exploring 
time-dependent properties of the flow is with 
pathlines. The interactive nature of the tool is 
important when tracing pathlines; the static picture 
of Figure 4 is not ideal for portraying a time- 
varying simulation. 

Interesting transient behavior is shown by the 
green pathline west of Central America. A mass- 
less particle released at the first time step recircu- 
lates in a very small region for most of the simula- 
tion before moving off to the northwest at the end 
of the month. The figure demonstrates a primary 
distinction between streamlines and pathlines— the 
streamlines are isolines of a vector field and cannot 
cross themselves or each other in the two-dimen- 
sional field. Pathlines have no such restrictions, as 
each line is the projection of two-dimensional plus 
time trajectory onto the plane. Streamlines are 
contrasted with pathlines in the figure by starting 
light blue streamlines and one pink pathline in the 
same region of the equatorial Pacific. The pathline 
moves erratically but follows the general flow 
pattern indicated by one streamline. The other 
streamline falls into a sink in Indonesia, which 
exists in the July 1 mean flow field. The red 
pathline indicates that the flow in the middle 
latitudes is smooth and predominantly longitudinal 
for the entire month. 

4.3 Critical Point Analysis 

The streamlines shown in Figure 3 were traced 
from starting positions separated by five degrees of 
latitude. The starting points for these were chosen 
by the user arbitrarily, and the resulting picture 
misses important details. In the tropics, more 
streamlines are required to resolve smaller scale 
structures, while many streamlines in the smooth 
flow of the mid-latitudes are redundant. The 
identification of flow topology with greater 
precision is possible by examining the critical 
points of the flow, as demonstrated by Dickinson 
(1991) and Froncioni and Peskin (1993). The 
techniques of critical point analysis are applied in 
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the GCM tool only as a useful example. A complete 
implementation requires the construction of a 
connectivity diagram that identifies all critical 
points in the data and the streamlines that connect 
them. 

The critical point equation, applied as a user 
filter in Figure 1 , defines a vector field that is 
meaningful only near a saddle point. In this region, 
the field points to the saddle so that a field tracer 
started in this region will move directly to the 
saddle point. A close look at the streamline pattern 
of Figure 3 indicates the existence of a saddle point 
in the tropical Atlantic Ocean. Using the mouse, a 
location off the African coast is selected, and a line 
of the newly calculated critical field is traced, 
which is shown in red in Figure 5. The field line 
terminates at the saddle point to the west. To verify 
that this is the proper location of the saddle point, 
two streamlines are shown in green. The lines are 
initially close to each other and are well correlated 
as they travel eastward over South America. 
However, the streamlines diverge immediately as 
they pass on opposite sides of the saddle point. 


5. CONCLUSION 

We have presented the SCENE interactive data 
exploration environment and demonstrated its 
applicability for the manipulation of large 
multidimensional data sets. The use of an object- 
oriented program structure makes it possible to 
create extensible, customized visualization tools. 
This flexibility is achieved without any 
programming required by the user. 

This is demonstrated with a visualization tool 
for the exploration of data from numerical general 
circulation models. The simulated general 
circulation can be visualized in several ways. At a 
specified time, the instantaneous flow field can be 
explored with streamlines in combination with 
critical point analysis. Derived fields can be 
generated and displayed interactively with user- 
added data filters. The graphic displays remain 
linked with the underlying data, which enables 
examination of the data by clicking on the picture. 
The time-dependent flow field was investigated 
qualitatively with pathlines. 



Figure 5. Saddle point locator (red) and streamlines (green) for the 0.25-kg/m 3 density altitude of the GISS-GCM, for a typical 
daily mean flow field. Two streamlines, initially similar, diverge as they pass on either side of the saddle. 
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SCENE is available on the Internet via anonymous 

FTPtocesl.rutgers.edu. A multimedia demonstration 

will soon be accessible via WWW. For further 

information or details about obtaining and configuring 

S CENE, contact rrosen@caip.rutgers.edu. 
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Within 12 months, a 5-year space-based research inves- 
tigation with an estimated daily data volume of 10 to 15 
gigabytes will be launched. Our instrument/analysis team 
will analyze 2 to 8 gigabytes per day from this mission. 
Most of these data will be spatial and multispectral, 
collected from nine sensors covering the UV/Visible/NIR 
spectrum. The volume and diversity of these data and the 
nature of its analysis require a very robust reduction and 
management system. This paper is a summary of the 
system 's requirements and a high-level description of a 
solution. The paper is intended as a case study of the 
problems and potential solutionsfaced by the new genera- 
tion of Earth observation data support systems. 


1. INTRODUCTION 

During the next decade, a new generation of 
Earth-observing platforms will be acquiring large 
and diverse data sets. To analyze these data effec- 
tively, new data management and reduction tech- 
niques must be developed. We present here a case 
study of the data reduction, management, and 
analysis system currently under development for 
the Ultraviolet-Visible Imaging and Spectrographic 
Imaging (UVISI) instrument aboard the Midcourse 
Space Experiment (MSX). This instrument will be 
used for Earth and celestial observations and has an 
estimated accumulated data volume of 1 0 terabytes 
over the 5-year mission. The following sections 
include a description of the platform and the 
instrument to be supported and the data set itself, a 
summary of the data analysis requirements, and a 
discussion on the architecture of the system under 
development. 

2. THE DATA SOURCE 

The MSX spacecraft will be placed in a polar 
orbit at an approximate altitude of 900 km. The 
spacecraft carries several optical remote sensing 
instruments, including the UVISI instrument, which 
is our payload of interest. Aside from small line-of- 
sight modifications affected by scan mirrors, ail of 
the optical sensor pointing is determined by the 
attitude of the spacecraft. All of the optical instru- 
ments share a nominal optical axis, which can be 
pointed in an arbitrary manner. 

Data from the satellite are downlinked at a 
single ground site at a rate of 25 megabits per 
second. The total data volume is therefore limited 
by contact time with the ground station. It is 
estimated that the average daily data output will be 
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10 to 15 gigabytes. Onboard tape recorders can 
record data at both 25 and 5 megabits per second. In 
the 5-megabit mode, data can be collected for more 
than 5 hours per day. The daily data rate of UVISI, 
the payload of interest, is determined by mission 
priorities. UVISI shares the downlink and other 
resources with an infrared instrument with a nomi- 
nal 1 8-month lifetime. During this period, the 
infrared instrument will have priority on the link, 
and the UVISI data stream is estimated at 2 to 5 
gigaby tes/day. For the remaining 42 months of the 
60-month mission, the data rate for UVISI is 
expected to regularly exceed 8 gigabytes/day. 

UVISI is composed of nine sensors, consisting 
of four spatial imagers and five spectrographic 
imagers (SPIMs) [Carbary, 1992], Each of these 
sensors has its own optics and intensified charge 
coupled device (ICCD) detector. The intrascene 
dynamic range for these sensors is 12 bits (4096). 
The interscene dynamic range is determined by the 
intensifier gain and is approximately 1 06. The 
framing rate for all of the sensors is either 2 or 4 
hertz. The spatial imagers provide a variety of 
angular coverages and spectral sensitivities. Filter 
wheels can be used to select spectral bands. The 
spatial imagers do not employ scan mirrors, and are 
thus completely dependent on spacecraft attitude 
for pointing. The SPIMs employ dispersive optics 
to create spectral images, which contain both 
spectral and spatial information. Each SPIM can 
acquire up to 40 spectra of 272 channels simulta- 
neously. The field of view of the SPIMs is much 
more narrow than that of the imagers. However, the 
field of regard is increased by the use of scan 
mirrors. The SPIMs provide continuous spectral 
coverage from the far ultraviolet (FUV) to the near 
infrared (NIR) ( 100 to 900 nanometers). The large 
dynamic range, spectral coverage, and variety of 
instrument modes provide the investigators with the 
means to explore a great number of different 
phenomena. 

3. THE DATA SET 

The large data rate and the flexibility of both 
satellite pointing and observation modes make 
UVISI a very challenging data set to manage and 
analyze. Because of the size and complexity of the 
data, recalibration and data location are expensive 


tasks. For this reason, several different types and 
levels of data products are planned. 

The primary archive of instrument data will be 
composed of minimally processed raw data (termed 
level 1 a data) . The minimal processing resolves 
overlaps, separates the data by instrument, and 
places the data in the proper time order. The data 
are not stored in a calibrated form for two reasons. 
The primary reason is the expense of recalibrating 
large portions of the data set. With the present 
scheme, only the fraction of the data that is of 
interest to the investigators needs to be calibrated. 
Furthermore, the process of calibration generally 
expands the size of the original raw data set, by 
converting integer values to the floating point and 
specifying the uncertainties of the calibrated data. It 
has been estimated that the UVISI data will expand 
by a factor of 8 when calibrated. The storage of 
calibrated data is therefore much more expensive. 

To be useful, the data must be calibrated. For 
UVISI, a standard set of calibration routines and 
data is supplied to our facility by the UVISI instru- 
ment team [MSX, 1991]. All calibration changes 
are then reflected in the software or calibration 
data. Thus, only the most important data need to be 
recalibrated. The output from the calibration 
software is termed level 2 data. These data contain 
radiometrically calibrated images but do not contain 
processed pointing information. In addition to the 
calibrated data, level 2 files contain indicators of 
instrument configuration and data health. 

Data location also is a serious challenge to the 
analysis of the UVISI data. To facilitate the 
location of useful data, summary files are produced 
for each data set. These summary files contain 
sensor configuration and data quality information, 
as well as calibrated radiometric summaries of the 
data taken. For the spatial imagers, the radiometric 
summaries include mean pixel values and overruns 
for different portions of the focal plane. For the 
SPIMs, the total intensity measured within a set of 
given spectral ranges is reported. These radiometric 
values are much smaller than the images themselves 
and can be used for rapid searches. All of the 
summary products are produced “upstream” of our 
facility, and the details of their creation will not be 
described here. 

Knowledge of the radiometrics is of limited 
utility without sensor pointing data. Corrected 
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ephemeris and attitude information for the MSX 
spacecraft is supplied by a processing center 
dedicated to this task. Our facility is provided with 
the spacecraft position and the attitude (in quater- 
nion form) of the nominal optical axis of all the 
instruments and with sensor offsets from this axis. 
We can thus determine the attitude of a fiducial 
point for each of the UVISI sensors. It is our task to 
convert this set of ephemeris attitude data into 
useful pointing quantities (such as spacecraft 
latitude, tangent point altitude, etc.) for either the 
center of each imager or spatial pixel. It is also our 
task to merge the pointing data with the radiometri- 
cally calibrated data. 

The products outlined above will provide most 
of the information necessary for our data reduction, 
management, and analysis. Low rate information, 
such as planning data and spacecraft housekeeping, 
will also be used but will not be central to our 
processing. 

4. ANALYSIS REQUIREMENTS 

The flexibility of UVISI allows a large number 
of phenomena to be studied. The investigators at 
our facility have defined 1 5 different experiment 
plans, ranging from auroral studies to lightning 
measurements. These plans define dedicated 
observations by UVISI. In addition to these dedi- 
cated observations, other investigation teams will 
use UVISI for celestial observation, for contamina- 
tion studies, and as a complement to the infrared 
observations. Data taken during these other obser- 
vations also may be of use to our team. Further- 
more, several analysis tasks have already been 
defined that combine data from planned observa- 
tions. These are essentially plans to re-use the data 
taken during the dedicated observations for alter- 
nate purposes. It is expected that the definition of 
these alternate analysis plans will continue through- 
out the mission, thus requiring us to be able to 
identify data that are useful to such tasks. We have 
thus separated the UVISI data analysis into three 
types of processing tasks : 

• Planned experiment processing 

• Planned re-use processing 

• Historical re-use processing 

The principle use of this classification is data 
identification. Data from the planned experiments are 


selected on the basis of the mission’ s operations 
schedule. The planned and historical re-use data are 
identified by instrument pointing, configuration, and 
other metadata describing the observations. In the case 
of planned re-use data, the data are selected from the 
incoming data stream. The historical re-use data 
involve extracting data from the UVISI archive. 

Once the data for a particular analysis task have 
been identified, they must be reduced or otherwi se 
processed according to the requirements of that task. 
Most of the tasks have differentprocessing require- 
ments. After the data have been prepared for analysis, 
they must become accessible to the investigators i n an 
interactive mode. Data reduction and analysis thus fall 
naturally into two categories: ( 1 ) pipeline data produc- 
tion and (2) interactive analysis. The investigators have 
been encouraged to place any analysis processes that 
can run in an unattended, automated mode in the 
pipeline for that task. This will allow the investigators 
to concentrate on the less stable elements of their 
analysis. It is expected that as the mission progresses 
and the analysis algorithms and the instrument data 
become better understood, the processes that had been 
interactive will migrate to pipeline processing. After 
the data have been through the pipeline, it will be 
available for interactive analysis. A schematic of the 
data flow is illustrated in Figure 1 . 



Figure 1. Data separation and processing. 


5. ARCHITECTURE 

The high-level architecture of our system is 
presented in Figure 2. The four basic subsystems 
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are: automated data production, catalog, 
investigator’s interface, and the archiver. When 
data are received, all of the incoming data are made 
available to the automated data production sub- 
system. The summary files and attitude information 
are passed in parallel to the catalog subsystem. The 
catalog subsystem builds a unified description of 
the sensor data from the files that it receives and 
passes this information back to automated data 
production. Based on this unified summary, the data 
production system decides how to process the 
sensor data. The data are processed, and the output 
of the processing is placed in working storage. A 
summary of these products is sent to the catalog 
subsystem. The data have a limited lifetime in 
working storage and will be placed into the deep 
archive when it is not being used. The archiver 
subsystem manages data transfers between working 
storage and the deep archive. The archiver may also 
place data in the historical sensor data area, which 
can be used as an alternate input to automated data 
production. The investigator’ s interface is an end- 
user graphical user interface to the data and a set of 
analysis tools. It inputs data directly from working 
storage; historical analysis can be initiated by 
searching the catalog summaries for the data of 
interest and requesting their promotion to working 
storage from the archiver. Any analysis products 
that are deemed suitable for permanent storage can 
be placed into working storage for subsequent 


archival (a summary of these data must be placed in 
the catalog for future access). 

The automated data production subsystem is 
based on the individual analysis tasks. Each task is 
associated with a processing “box.” These boxes 
are constructed based on the requirements of their 
respective tasks. During the construction of the 
boxes, we have built a library of common data 
manipulation tools, which are also shared with the 
investigator’ s interface. Each box is designed to run 
independently and in an unattended mode. The 
subsystem also contains a “box integration” ele- 
ment, which selects data for the boxes, schedules 
their execution, and organizes their output in 
working storage. In effect, the box integration 
element acts as a shell that determines how the 
boxes run. The operational scenario of the auto- 
mated data production subsystem is as follows. 
Upon receipt of new data, box integration queries 
the catalog to determine which boxes, if any, should 
be applied to the new data. The sensor data are then 
sent to working storage for archival. Box integra- 
tion then organizes the data appropriate to each 
selected box and then executes that box. The boxes 
reduce the data, reporting their progress in a status 
log, managed by box integration. Upon completion 
of box processing, the output products are then 
organized appropriately in working storage. A 
summary of these products is also produced by box 
integration and is posted to the catalog. Thus the 
data have been reduced, described, and prepared for 
archive. 

The subsystem catalog contains a description of 
all the products that are available at our facility. 

The relationships among the products are also 
described by the catalog, so that a unified descrip- 
tion of the facility data holdings is always available. 
Summaries of the sensor data and derived products 
are constructed from the summary files and the 
attitude information. Note that the attitude informa- 
tion is converted into useful pointing quantities as it 
is loaded. The use of the summaries not only 
provides an excellent description of the data hold- 
ings, but also concisely describes the mission itself. 
Thus, the catalog can provide a history of the 
mission as well as assist data location. We have 
separated the catalog into two parts: the staging 
catalog and the historical catalog. The staging 
catalog contains only the newest data (these data 
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have a lifetime of about 7 days). The historical 
catalog contains data summaries from the 
entire mission. The staging catalog is used by the 
automated data production subsystem when it is 
processing new data. It also serves as a quarantine 
area and transaction buffer to protect the historical 
catalog from badly formatted input data and to 
allow for more efficient population of the larger 
catalog. The historical catalog will allow the users 
to survey the entire data holdings without having to 
access the data in the deep archive. The estimated 
size of the deep archive is 10 to 12 terabytes, 
whereas the catalog will only require about 10 to 20 
gigabytes, and can remain online throughout the 
mission. The historical catalog is used by the 
automated data production system for selection of 
historical sensor data for re-use processing. It can 
also be used by the investigators to determine 
whether the current data holdings can usefully 
support a perspective analysis task. The historical 
catalog adds much value to the holdings of our 
facility, in that a small amount of useful data can be 
located and extracted from the large archive. 
Without the catalog, much of the data would rot on 
the shelf. 

The investigator’ s interface subsystem is a 
graphical user interface that provides access to all 
levels of the data, as well as the means to analyze 
them. Various data analysis tools, developed in 
concert with the box processing, can be used 
interactively or be assembled into batch processes. 
The library of tools can be extended by the users 
and may include specialized tools necessary for 
particular analysis tasks. The interface includes 
several functions for locating, reading, and writing 
data in working storage. Final products constructed 
during interactive analysis can be registered in the 
catalog and stored. Intermediate and working 
products can be placed in a user’s own work space. 
The interface will include access to several graphi- 
cal display and analysis packages, such as IDL, 
AVS, and Khoros. This will allow users to analyze 
their data with familiar tools without having to 
worry about writing tedious input/output routines. 

The archiver subsystem moves data to and from 
the deep archive. The bulk of the facility’s data 
holdings will be stored in the deep archive. The 
current primary storage medium for the deep 
archive is tape storage. The archiver is a combina- 


tion of the automated process of tracking data files 
and their location on physical media and the manual 
process of walking between the shelf and the tape 
drive. The archiver process maintains a data base of 
the contents of the deep archive and the working 
storage area. It also is responsible for migrating 
data to and from working storage. Data will reside 
in working storage only as long as they are actively 
in use. After a given time of inactivity, if the data 
have not already been placed in the deep archive, 
they are archived and deleted from working storage, 
Data that already reside in the deep archive are 
simply deleted from working storage. Reliable 
performance of the archiver is crucial to the success 
of the project, so every effort has been made to 
keep its design as simple as possible. 

6. SUMMARY AND DISCUSSION 

We have presented a system architecture for the 
reduction, analysis, and management of large 
volumes of Earth observation data. This architec- 
ture is sufficiently flexible to allow for changes in 
instrument operation and calibration, as well as 
changes or additions to the reduction and analysis 
algorithms applied to the data. Our primary goal in 
the construction of the UVISI processing system is 
to allow the investigators to concentrate on the 
content of the data rather than the details of the data 
processing system. 

By far, the most challenging elements of 
manipulating a data set of this size (approximately 
10 terabytes) are data location and flexible process- 
ing. Unless robust data location methods are 
implemented, the task of finding data useful for a 
particular analysis becomes impractical. Flexibility 
of processing is vital to research investigations such 
as these. More often than not, data processing 
systems have to be significantly modified because 
of unforeseen circumstances after data acquisition 
had begun. The utility of the data reduction, 
management, and analysis system cannot be over- 
emphasized. A properly implemented system allows 
the investigators to maximize the utility of the data 
set and to apply previously acquired data to new 
problems. Without this capability, newly formu- 
lated problems will require new missions, which in 
turn will acquire large volumes of only marginally 
useful data. 
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Several problems posed by the rapidly growing vol- 
ume of geophysical data are described, and a selected 
set of existing solutions to these problems is outlined. 
A recently developed desktop software tool called the 
Grid Analysis and Display System (GrADS) is pre- 
sented. The GrADS’ user interface is a natural exten- 
sion of the standard procedures scientists apply to 
their geophysical data analysis problems. The basic 
GrADS operations have defaults that naturally map to 
data analysis actions, and there is a programmable 
interface for customizing data access and manipula- 
tion. The fundamental concept of the GrADS’ dimen- 
sion environment, which defines both the space in 
which the geophysical data reside and the “slice ” of 
data which is being analyzed at a given time, is ex- 
pressed. The GrADS ’ data storage and access model is 
described. An argument is made in favor ofdescrib- 
able dataformats rather than standard data formats. 
The manner in which GrADS users may perform op- 
erations on their data and display the results is also 
described. It is argued that two-dimensional graphics 
provides a powerful quantitative data analysis 
tool whose value is underestimated in the current 
development environment which emphasizes three- 
dimensional structure modeling. 


1. INTRODUCTION 

The earth scientist today faces an exponentially 
increasing volume of observational and model 
output data. The fraction of that data that can be 
sampled, organized, archived, and intelligently 
analyzed grows constantly smaller as the improve- 
ment of computer hardware and software fails to 
keep pace with the growing volume of data. 

While it is true that computer hardware devel- 
opment has advanced rapidly, there are a number of 
problems with the software that has been and is 
being developed for the high-end workstation 
platforms. A large number of data analysis and 
visualization programs are available in the form of 
commercial off-the-shelf software (COTS). Many 
of these programs hold great promise for solving 
the data volume problem, but they have a number 
of disadvantages, which make them inaccessible to 
a large portion of the earth science community. 
Many COTS products are too expensive or too 
difficult to learn for many scientists who have 
limited funding or time budgets. Several COTS 
programs support only their own internal data 
format or a limited number of “standard” or “com- 
mon” data formats so that a given scientist’ s data 
may not be handled by these programs. Most earth 
scientists have found that 80386- or 80486-based 
computers are adequate for their work and are 
consistent with their equipment budgets, but COTS 
developed for RISC/UNIX computers or for special 
purpose hardware (such as three-dimensional 
graphics acceleration boards), will not ran on these 
machines. Also, in our opinion, there has been an 
overemphasis on three-dimensional structure 
modeling and object rendering, which, admittedly, 
provides a useful tool for intuitive or conceptual 
exploratory data analysis but is seriously lacking in 
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the kind of quantitative data analysis scientists 
require on their desktops that is provided by two- 
dimensional graphics. Finally, it is our opinion that 
COTS products do not offer all that is needed to all 
earth scientists because they typically have a steep 
learning curve and require sophisticated program- 
ming (such as the construction of modules and 
module networks in the case of data flow environ- 
ment software). 

Existing visualization software is frequently 
less useful to earth scientists than it could be 
because it is designed and written with a singular 
emphasis on computer graphics or an “ergonomic” 
user interface that purports to be intuitive by 
allowing users to navigate through nested menus or 
linked lists of point-and-click mouse button events. 
The result is that insufficient design attention is 
paid to the scientific problems at hand. In 
particular, existing software does not adequately 
link together the processes of data analysis and 
visualization. In cases where analysis is 
emphasized, scientists must export their results to 
other platforms or software packages to depict them 
graphically. When visualization is emphasized in a 
given program, there is no capability for 
quantitative analysis or comparison among results, 
and the linkage between the data and the 
geophysical context (map background, earth- 
registered coordinates, etc.) is missing. Although 
we make these assertions without proof, there are 
some articles and scientist surveys that support 
them (such as Doswell, 1992; Botts, 1993, personal 
communication). 

Our experience is that earth scientists are only 
briefly enamored of the graphical user interface 
(GUI) to a particular data analysis and visualization 
program when they first begin using the program. 
After becoming familiar with the program, they 
find that the multitude of menus, button clicks, and 
dialog boxes slow down their productivity. Many 
earth scientists desire graphical user interfaces to 
their own data — that is, they want to have a graphi- 
cal means of accessing, manipulating, and display- 
ing their own data that is intuitively familiar and 
consistent with the way in which they normally 
approach their data. Thus, a program GUI, which 
provides users a graphical means of accessing all 
the myriad features of a visualization program, 
often gets in the way, but a data GUI, which pro- 


vides a simple means for users to work with their 
own data, is highly desirable. 

Finally, many earth scientists have taken 
exception to existing programs because they are 
insufficiently extensible or customizable and 
because they divide the tasks of data analysis and 
visualization rather than linking them more closely 
together. Earth scientists want to have the capability 
to add functionality, add their own analysis 
techniques, and customize existing programs to 
their data. They have traditionally done this with 
FORTRAN and the programmable (batch) graphics 
software packages (such as NCAR Graphics). Earth 
scientists also have integrated data analysis with 
visualization in the past by writing FORTRAN 
(batch) programs that include both calculations 
involving basic data sets and calls to graphics 
libraries to display the results of those calculations. 
They want to be able to achieve the same level of 
integration interactively so that the data analysis 
process can be streamlined and made more 
intuitive. 

To provide a means for conducting geophysical 
data analysis interactively, we have designed and 
implemented the Grid Analysis and Display System 
(GrADS). This program integrates the functions of 
data access, data manipulation, and data visualiza- 
tion, providing users a single interface to their own 
data [Kinter and Doty, 1993]. The GrADS program 
has been designed to minimizing dependence on 
expensive computer hardware components such as 
system memory and graphics boards. GrADS runs 
well on most UNIX platforms (Sun Space, SGI, 
IBM, HP, DEC, etc.), as well as 80386(87)/80486/ 
Pentium-based personal computers. We have 
introduced this program to an international commu- 
nity of geophysicists who have found it quite 
suitable for their interactive data analysis require- 
ments. The following sections briefly describe the 
GrADS solutions to the problems outlined above. 

2. USER INTERFACE 

The GrADS program implements a command 
line interface in which the operations of data 
access, data manipulation, and data display can be 
performed by entering commands at a program 
prompt. To control data access, the user enters 
commands that describe the spatial and temporal 
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subsets of interest. To manipulate the data, the user 
enters expressions that specify the operations to be 
performed on the data. To control the display of the 
data, the user has a rich set of commands available 
to specify display types and attributes. 

The GrADS user typically interacts with the 
program by first entering commands to open one or 
more data sets, then entering commands to set the 
desired dimension and graphics environments (or 
accepting the defaults provided by GrADS). Once 
this is done, the user may issue the di splay 
command with an expression as the operand of the 
command. This single command results in data 
being accessed, operated on, and displayed. When 
the user views this display and, as a result, wants to 
see something else, the user can simply enter 
another display command. If this is done without 
an intervening clear command, the new display is 
overlaid on the first. Judicious settings of the 
display environment used with overlaid displays 
can produce striking graphics with a high degree of 
quantitative information content (see Figure 1 ). 

The GrADS scripting language (GSL) provides 
a programmable interface to the GrADS package 
[Doty and Kinter, 1993]. GSL is implemented as an 
interpreted language with a full set of operators and 
flow control capabilities. Terminal and file input/ 
output is supported. The interface between GSL and 
the rest of GrADS provides a rich querying capabil- 
ity for data values, data attributes, the dimension 
environment, and the graphics environment. Users 
can develop scripts (using a standard text editor) to 
issue sequences of GrADS commands under 
program control. While executing the GrADS 
program, users can run the scripts they have created 
to allow a quasi-batch mode of operation (including 
a UNIX command line option to allow access to the 
shell) or to realize more complex or sophisticated 
data analysis and display capabilities. 

The GSL incorporates a capability for users to 
develop a simple GUI to their own data. The 
GrADS command language includes commands for 
drawing simple widgets on the screen, such as 
buttons or sliders. Once the screen has been 
configured with combinations of widgets and 
graphics, a GSL script can query the location of the 
next mouse point-and-click. This location is 
returned to the script either as a widget number and 
associated values (if the click was on a widget) or 


with coordinate values (if the click was on a 
graphic). When the user clicks on a widget, some 
appropriate action can be taken. When the user 
clicks inside a graphic, such as a contour plot, the 
coordinates can be transformed to world or grid 
coordinates and to some action taken, such as 
zooming or querying data. These simple capabilities 
provide for a wide variety of data GUI functions 
that users can design and implement in a very short 
time. 

Several users have employed these basic 
building blocks to develop highly sophisticated 
graphical interfaces to their data. Examples of 
scripts that GrADS users have written include: 

• A script that provides a point-and-click 
interface for viewing real-time surface 
airway meteorological data. The script 
allows the meteorologist to select the plot 
to be displayed via a menu of buttons. Each 
plot is a fairly complex overlay of several 
graphics displaying two or more variables 
in different ways, so that their associations 
are visually obvious and quantitatively 
determined. A click on the displayed 
graphic results in a zoom centered on that 
location or, in the case of a nested zoom, 
results in the display of a time series of the 
selected variable at the nearest station. 

• A script that provides a point-and-click 
interface for viewing sea surface tempera- 
ture (SST) over the Pacific Ocean. The 
script begins with a shaded contour display 
of a longitude(abscissa)-time(ordinate) 
section of SST with the selected latitude 
fixed. After pointing and clicking twice on 
the shaded contour plot, an animation is 
displayed of latitude-longitude contour 
plots, animated through the time period 
bracketed by the two clicks. 

• A script that provides a point-and-click 
interface to the NOAA/NMC Nested Grid 
Model output. The script displays a menu 
of buttons that allows variables to be 
selected and plots for desired regions to be 
displayed. The menu also provides a time 
bar for selecting a specific time or a range 
of times for animation. The user may also 
display multiple variables by overlaying 
different graphics. 
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These scripts, and many others, have been 
developed by users to give them an interface to 
their data, rather than an interface to the GrADS 
program or to the graphics. We plan to support 
further development for such data GUIs. 

The GrADS help facility is provided through a 
set of scripts that allow the user to review various 
commands and options via apoint-and-click 
interface. Examples that demonstrate the use of 
various commands and options can be displayed 
graphically on the screen. In addition, a plain text 
manual describing the full set of GrADS commands 
and options is provided. 

3. THE DIMENSION ENVIRONMENT 

The concept of the dimension environment is 
fundamental to the GrADS program. The user de- 
scribes a space and time subset of interest. This is 
specified as a combination of fixed and varying 
dimensions. The number of varying dimensions 
determines whether the subset is one-, two-, or three- 
dimensional. To specify a fixed dimension, the user 
would enter a single value for the dimension. To set a 
varying dimension, two values would be entered, 
indicating the desired range for that dimension. 

The dimension values may be specified in 
either world coordinates or in coordinates specific 
to a particular data set (i.e., file coordinates). When 
file coordinates are used, they are converted to 
world coordinates using the transformation associ- 
ated with a particular file, referred to as the default 
file. Even if we specify the dimensions in terms of 
file coordinates, GrADS maintains the dimension 
environment in terms of the world coordinates. This 
world view of the data is then translated back to file 
coordinates when data are actually accessed. 

The dimension environment is a concept which 
is independent of the data. When we set the dimen- 
sion environment, we are, in effect, describing a 
viewport into the four-dimensional data world. We 
do not need to worry about where those data are 
located in a particular file. For example, if we set 
the time at January 1, 1993, and open several files, 
we may find that January 1, 1993, is the first record 
in the first file, the 366th record in the second file, 
and the 32nd record in the third file. When data are 
accessed from any particular file, the January 1 , 
1993, record will be accessed, even though the user 


has not explicitly referred to any particular physical 
location of the data in any of the files. 

The spatial view of the data is also handled in a 
file-and-data independent fashion. For example, 
suppose that two files have been opened. The first 
is global in extent and contains a grid that is 1 ° by 
1 0 in resolution. The second file has a 0.2° grid and 
covers only North America. To set the spatial 
viewport to the quadrant of the globe that includes 
North America, the following commands are 
entered: 

set Ion -180 0 
set lat 0 90 

When data are accessed from either file, we will 
get the data that cover that quadrant. In the case of 
the first file, those data are on a 1 ° x 1 ° grid, and 
that grid is fully populated. In the case of the 
second file, the data are on a finer grid, and the 
areas in the quadrant outside North America (such 
as the central Atlantic Ocean) are undefined, 
because those areas do not exist in the second data 
file. The undefined areas of the grid are automati- 
cally filled with missing data values. If graphics 
from both grids are displayed, they are correctly 
overlaid with missing values treated appropriately. 

It is important to repeat that the user need not 
be concerned about the physical storage particulars 
of the data. Data can be easily and quickly accessed 
and displayed from the perspective of the dimen- 
sion environment, which provides a consistent 
geophysical view of the data that is integrated 
throughout the software. 

4. DATA ACCESS AND FORMAT 

The GrADS program is structured to access 
data from a file based on the current geophysical 
viewport, described by the dimension environment. 
Transparently to the user, GrADs translates world 
coordinates into file coordinates, computes the 
physical location of the data in the file, and ac- 
cesses the desired subset of the data. Because the 
actual file structure and data format are decoupled 
from the user view of the data, any file structure or 
data format can be handled. 

The particulars of a data file are described to 
GrADS via a data description file. This is a flat text 
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file that uses blank-delimited keywords to describe 
a particular data file. In general, the description file 
describes two aspects of the data. The first aspect is 
the orientation of the data in space and time. This 
provides for translating the user’s view of the data 
into the physical storage organization of the data in 
the file. The second aspect of the file that must be 
described is the specific format of the data. This 
includes such things as byte ordering, record 
format, file headers, and so on. The data description 
file also contains the missing data value for the file. 
When encountered, this missing data value is 
assumed to represent data that is unavailable. This 
value is used throughout the entire GrADS pro- 
gram, so that missing data are handled properly 
during data operations and display. 

The rationale for the data description file is that 
a given format must be describable to be useful. In 
this context, a describable file format is one whose 
characteristics can be written down in a simple finite 
form, such that an individual element of the file can 
be mapped into its spatial, temporal, and scientific 
domain (such as variable type and units). The 
GrADS program has been designed so that, given a 
data set with a describable file format, a GrADS 
description file paradigm can be implemented to 
describe the data format in the GrADS framework. 
The implementation of support for a describable 
format within GrADS is straightforward. 

Currently, GrADS supports two gridded file 
formats, with a variety of minor variations allowed 
within the supported formats. The first format 
supported is the native GrADS format. This format 
is a simple IEEE floating point binary format, with 
the data elements ordered in a specified way. This 
format is designed so that it can be created, and later 
read and/or modified, by a simple FORTRAN or C 
program. The files are portable among all UNIX 
systems (including Cray UNICOS) and PCs. Minor 
variations to the native GrADS format allow the byte 
ordering of the IEEE words to be specified, allow 
the ordering in the Y and Z dimensions to be speci- 
fied, and allow for file and record headers. The data 
may be written using direct access (stream) I/O, 
convenient for C programs, or using sequential I/O, 
convenient for FORTRAN programs. 

GrADS also supports the GRIB records format 
established as a standard by the World Meteorologi- 
cal Organization. The GRIB records format packs 


grids into an arbitrary number of bits per grid 
element and contains a specified record header that 
describes the data. In the case of GRIB, the data 
format is well described, but the translation be- 
tween the ordering of data in a file and the orienta- 
tion of the data in space and time cannot be prede- 
termined. Thus, a GrADS utility is executed that 
creates an index file before GrADS is used to 
analyze and display the data in that file. This file is 
used by GrADS to quickly determine the location 
of a particular item of data in the data file, once the 
user request has been made. 

We feel that the concept of indexing the data, to 
allow efficient and user transparent access to 
randomly organized data, is very important and 
powerful. In the case of GRIB, it allows the GRIB 
records to be ordered in any way in the file, and it 
allows partitioned records to be handled completely 
transparently to the user. Partitioned GRIB records 
occur when a single global grid is partitioned into 
multiple records so that a particular record does not 
exceed some arbitrary length. 

In the case of station data, GrADS currently 
implements its own station data format. The GrADS 
station data format stores the data in an IEEE 
floating point binary format. The file is structured 
into reports, where each report contains a header 
(which describes the station identifier and the 
latitude, longitude, level, and time of the data) and 
one or more collections of data. Multiple levels of 
data may be contained in one report. In a way 
similar to the handling of GRIB data, a utility is run 
to index the station data. 

GrADS can treat multiple data files as a single 
logical GrADS file, when the data files are split in 
the time dimension. For example, given monthly 
samples over a period of years, each month could 
be stored in a separate file, or each year could be 
stored in a separate file (12 months per file). 

GrADS determines which file contains what time 
periods by parsing the file name. The user specifies 
the file name using a naming convention expected 
by GrADS and specified as a file template in the 
data description file. The template specifies which 
temporal elements (year, month, day, hour, and 
forecast hour) are used and in what order they are 
used in the names of files to be treated as a logical 
unit. When a data access is requested, the date/time 
of that access is used to fill in the full file name. 
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The data are then read from that particular file. No 
more than one data file is kept open at a time, 
which avoids exceeding the open file limit that 
exists on most operating systems. If a particular file 
is missing, GrADS views it as missing data, and 
when displayed, it is handled as though the file 
exists but is filled with missing data values. 

The file name template is particularly useful 
when handling real-time data. New files can be 
created and old files archived. Only the description 
file needs to be changed (to reflect the new time 
extent of the GrADS file), and utilities are provided 
to allow the time extent of a data description file to 
be updated “on the fly.” If the real-time data system 
should fail for a period of time, then the data files 
for that period would be missing. No special 
handling is required, because GrADS will automati- 
cally assume that the data is missing and handle it 
appropriately. 

5. OPERATIONS ON DATA 

Operations may be performed on data in 
GrADS by entering an expression. Expressions 
consist of combinations of operators, variables, and 
functions. A given expression operates over the 
range of the current dimension enviro n ment. 

Operators within a GrADs expression are +, -, 

*, and /, and the operations are addition, subtrac- 
tion, multiplication, and division, respectively. 
Operations take place over the entire range of the 
operands, which are variables, functions, or con- 
stants. An example of a GrADS expression is: 

ts*9/5+32 

In this example, a variable called ts is being 
converted from Centigrade to Fahrenheit. It is 
beyond the scope of this paper to document the 
entire syntax of the expression parser, but simple 
examples will be shown to illuminate specific 
points. 

V ariables represent “raw” data, which are 
contained either in a data file or in memory (a 
define command allows users to move data 
subsets to memory for more efficient manipulation). 
Variable names are specified by the user, either 
within the data description file (where GrADS 
variable names are assigned to data within a data 
file) or when the data are moved to memory. 


The range of the data for a given variable is 
assumed, by default, to be the range of the current 
dimension environment. Within the expression, the 
user may override the current dimension environ- 
ment. This allows, for example, a time difference of 
a variable to be easily calculated: 

ts-ts ( t — 1 ) 

In this example, the variable name is ts. A differ- 
ence is being calculated between that variable at the 
fixed time specified by the dimension environment 
(the default) and the same variable at an offset from 
the time specified in the dimension environment (in 
this case, one time step earlier). 

Multiple data files may be opened and variables 
accessed from any of those open files. An example 
of an expression that operates on data from differ- 
ent files is: 

( ts . 2-ts . 1 ) *9/5+32 

In this case, the variable ts is being differenced 
between two separate files (“.2” refers to the second 
open file while “.1” identifies a variable from the 
first open file) before being converted to Fahrenheit. 

Functions are also supported as part of the 
expression parser. Built in functions are provided to 
perform a variety of basic mathematical and statisti- 
cal functions and to calculate simple meteorological 
variables, such as vorticity. Some functions can 
operate over a large range of data, such as the 
averaging function. Arguments to this function 
specify the dimension range over which the averag- 
ing is to be done. A simple nested expression could, 
for example, perform a temporal and zonal average 
over several data sets. Such an expression could 
process several gigabytes of data. 

Another interesting function is maskout. This 
function takes two arguments. The operation of the 
function is such that the first argument is masked by 
the second and becomes the result. Whenever the 
second argument is negative, the corresponding 
data element in the result is set to missing. A 
typical application is to mask based on a land/sea 
grid. A maskout function, nested in an averaging 
function to calculate a global mean, could yield, for 
example, a time series of a variable averaged 
globally over only ocean points. 
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GrADS also supports functions written by 
users. Such functions operate externally to GrADS. 
These external functions are described via a User 
Defined Function Table (UDFT). This table is 
scanned when GrADS is first started, and functions 
described in the table are made available to the 
expression parser. When an expression is entered 
that uses such a function, GrADS goes through the 
following steps: 

• The arguments to the function are evalu- 
ated. If the arguments are expressions, the 
expressions are evaluated and the 
results obtained. 

• The value of each argument is written in a 
file in a documented format. 

• The user defined function is invoked as a 
separate process. This program may do any 
processing desired, including invoking 
GrADS recursively, reading GrADS data 
files, and so forth. 

• When processing is complete, the function 
writes its result into an output file, which is 
also in a documented format. The function 
then terminates execution. 

• GrADS reads the result of the function 
(from the output file) and continues the 
evaluation of the expression. 

Several functions are provided with the GrADS 
distribution as examples. One such function inter- 
polates from any arbitrary grid spacing to a differ- 
ent grid spacing. This allows operations to be 
performed on grids that have different spatial 
resolution. 

6. GRAPHICS OUTPUT 

The result of a GrADS expression can be 
displayed using a variety of standard graphics 
output techniques. One-dimensional displays 
include line graphs and bar charts. Two-dimen- 
sional displays include contour plots (either level 
curves or shaded regions), wind vector or wind barb 
plots, streamlines, grid box values, and station 
model plots for station data. All vector displays 
(vectors, barbs, and streamlines) may be colored 
using any scalar field (which may itself be the 
result of a GrADS expression). 

The user has a wide variety of options to 
control the way graphics are displayed. These 
options default to geophysically desirable values. 

Of particular note is the ease of control over the 
colors of contours, shaded contours, wind barbs, 
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wind vectors, and streamlines. The different graph- 
ics types may be easily overlaid to create complex 
graphics showing relationships between several 
different variables (Figure 1). 

It is important to note that the dimension environ- 
ment determines the range of the display. We will 
explain this concept with an example. Suppose the 
dimension environment is defined as follows : 

set Ion -180 0 
set lat 40 
set lev 500 

set time 06decl982 lldecl982 

The Y and Z dimensions are fixed, while the 
varying dimensions are X and T. Thus, the axes of 
the output plot are X (longitude) and T (time) over 
the ranges specified (Figure 2). Annotation of the 
axes is automatic and appropriate to the dimensions 
selected for display. If the result of the expression 
reduces the number of varying dimensions (such as 
applying the averaging function), the display would 
be a one-dimensional plot, such as a line graph, 
where the varying dimension is X or T (Figure 3). 
Note that it is valid for an operation on data to 
reduce the number of varying dimensions, but it is 
invalid for an operation to increase the number of 
varying dimensions or to change which dimensions 
are varying. 

Any graphics displayed on the screen can also be 
printed. The graphics are output in vector form to an 
intermediate metafile format, and they can then be 
translated to monochrome or color postscript using 
GrADS utilities that are provided with the GrADS 
program. Becase the postscript files are vector graphics 
based, they will be rendered at the printer using the full 
resolution of the particular printer. 
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Figure 1. Map displaying several variables at once. The level curves depict temperature with red tones for high values and blue 
tones for low values. The shaded contours represent relative humidity, with brighter green for higher values. Winds are depicted 
using standard meteorological wind barbs. 
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Figure 2. Longitude (abscissa) time (ordinate) diagram of500-hPa geopotential height. 
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Figure 3. Line graph of 500-hPa geopotential height averaged by GrADS over a range of times at a fixed latitude 
displayed as a function of longitude. The open circles represent gridpoint locations, and the bars represent standard 
deviation (computed by GrADS) about the mean of the times being averaged. 
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A number ofvisualization techniques are discussed in a 
laboratory experiment designed to study phenomena 
that occur in space. Visualization tools are used todesign 
the apparatus, collect data, and make one-, two-, and 
three-dimensional plots of the results. These tools are 
an indispensable part of the experiment because the 
data sets are hundreds of megabytes in size and rapid 
turnaround is required. 


1. INTRODUCTION 

As a result of advances in general technology, it 
is now possible to perform laboratory experiments 
relevant to space plasma physics that were not 
possible 15 years ago. These advances include 
developments in computers, digitizers, and other 
hardware, as well as large improvements in the 
technology of plasma devices and plasma sources. 
The increased sophistication of laboratory tech- 
niques allows laboratory studies of the fundamental 
physics of space plasma processes . The relevance 
of laboratory studies to space plasmas is ensured by 
keeping constant ratios of dimensionless param- 
eters. A successful laboratory study of a space 
plasma process involves selecting a particular 
phenomenon important to space plasmas that can be 
reproduced in a laboratory device in a scaled sense 
and performing measurements of its space and time 
evolution. Laboratory experiments can probe a 
process with unprecedented detail and uncover 
effects that may not easily be detectable in space. 

Plasma sources now exist that can produce 
quiescent and highly reproducible plasmas .These 
sources have been developed to allow experiments 
in which highly detailed space-time measurements 
of electromagnetic fields and plasma parameters 
can be routinely collected. Understanding and 
control of the plasma boundaries have developed. 
Plasma properties, such as density gradients, 
striations, collisionality, magnetic field profiles, ion 
composition, and percentage of ionization, can be 
tailored for investigations of an individual process. 

The development of laboratory plasmas alone 
would not allow the revolution in space laboratory 
experiments to occur. The accompanying strides in 
data acquisition [Gekelman, 1992] and workstations 
make possible the data collection and analysis 
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needed to process the experimental measurements. 
The advances in data acquisition technology 
include fast, and relatively cheap, workstations that 
have enormous amounts of ram and disk storage, 
fast ( > 1 GHz ) analog to digital converters, digital 
oscilloscopes, and programmable function genera- 
tors that can be interfaced to computers. The result 
of applying sophisticated acquisition techniques to 
laboratory experiments is data sets in excess of 100 
Mbytes. In a decade, laboratory data sets may 
exceed 1 terabyte. It is clear that without the proper 
analysis and visualization tools, understanding the 
experimental measurements would not be possible. 

This paper points out how and why experimen- 
tal data sets have grown over the last decade. We 
then illustrate the many ways in which a modern 
laboratory has become dependent on visualization 
in its everyday operation. Finally, we attempt to 
predict what the capabilities of a plasma physics 
experiment and its associated theoretical effort will 
be 10 years from now, as well as list the demands it 
will place on future visualization systems. As this 
book attests, there are many groups involved in 
plasma physics that use visualization techniques. 
Because this is not a review article, we cite ex- 
amples from work done at UCLA as illustrations. 
The other papers demonstrate how widely visual- 
ization techniques are used in the space plasma 
physics community. 

2, THE LAPD EXPERIMENTAL 

CONFIGURATION 

Twenty years ago, nearly all plasma physics 
experimental data were in the form of x-y plots and 
tables. Digitizers, as we know them today, did not 
exist. Transient data were captured by photograph- 
ing oscilloscope traces while sampled or continuous 
data were recorded with plotters. For example, if a 
waveform was recorded as a function of time at a 
fixed position, and the receiver was moved to other 
positions, the motion of a phase point could be 
laboriously measured using a ruler. When it is 
necessary to acquire two- or three-dimensional data, 
the data collection and analysis must be automated. 
We will illustrate this with data taken from the 
LAPD (LArge Plasma Device) at UCLA. The 
LAPD [Gekelmanet al., 1991] is a flexible, low- 
maintenance device designed to study a variety of 
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waves and nonlinear effects in fully magnetized 
plasmas. The plasma column is 50 cm in diameter 
and 10 meters in length. The magnetic field is 
controlled by seven independent supplies, which 
allow tailoring of the axial magnetic field profile. 
The maximum DC axial field is 2.5 kG which gives 
a highly magnetized plasma (a He plasma column is 
500 ion gyroradii wide). 

The plasma is produced by a DC discharge 
driven by an oxide-coated cathode. This type of 
plasma has proven quiescent (8n/n = 5%), and 
oxide-coated cathode sources are stable for long 
periods (= 6 weeks ), giving plasma discharges that 
are very reproducible from shot to shot. The plasma 
(H, He, Ar, Xe, Kr, Ne, or any mixture) can be 
moderately high in density (n < 5 x 10 12 /cm 3 ), and 
at the LAPD laboratory, we have achieved very 
highly ionized helium plasmas, with the percentage 
of ionization about 99 percent [Maggs et al., 1991]. 
The discharge is pulsed at several Hz to allow for 
efficient signal averaging and data processing. The 
plasma is diagnosed using a completely internal 
probe drive capable of moving, under computer 
control, an assemblage of magnetic and electric 
field sensors to any position within the plasma 
volume [Pfisteretal., 1991]. 

A volume data set is acquired by measuring the 
three components of the electric and magnetic fields 
at up to 32K time steps at a given location. 
Langmuir probes are used to measure the plasma 
density, plasma potential, and electron temperature 
at up to 1 6 times during a shot. The Langmuir probe 
is rapidly swept, and the probe current and voltage 
are digitized. Housekeeping functions, such as the 
neutral gas pressure and cathode temperature, are 
also digitized and stored. The probe is moved to the 
next position in the volume, and more data are 
taken. The data run could store results at 10K 
positions, resulting in data sets of 300 Mbyte. If 
experimental conditions are changed, different data 
sets are generated, each of which must be analyzed 
rapidly so that interesting features can be isolated 
and future data runs effectively planned. 

Of paramount importance in a well-diagnosed 
basic physics experiment is a state-of-the-art data 
acquisition system. The LAPD laboratory has 
assembled hardware and software that enable real- 
time storage and analysis of very large data sets 
[Gekelman and Xu, 1986]. The system is flexible 
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enough to perform real-time manipulations of 
incoming data and provide rapid turnaround for 
data reduction and display. The front-end of the 
system consists of a Y AX station 3900, which 
controls both CAMAC and GPIB systems in which 
analogs to digital converters, programmable 
amplifiers, stepping motor controllers, and so on, 
are located. The acquisition system is, in turn, 
networked to a Kubota Titan Super Graphics 
Workstation, DEC station 5000’ s, and an SGI 240/ 
VGX. These computers have hardware that can 
render three-dimensional images and rotate and 
zoom in real time. This capability is indispensable 
for visualizing the complex data sets generated 
from the types of experiments conducted at the 
LAPD laboratory. The workstations have up to 128 
Mbyte of RAM and a total of 9 Gbytes of disk 
storage. They run the latest in visualization 
software, such as AVS, Data Visualizer, PV-Wave, 
and Advanced Visualizer. In addition, our group 
has written many data analysis programs, as well as 
all the data acquisition software. 

The local laboratory network is linked with 
fiber optics to the rest of the UCLA campus, to 
access other research computers, and to a large 
variety of external networks. The most important 
link connects the LAPD with the UCLA 
Visualization Center, which was constructed, in 
part, by one of the authors (Gekelman). The key 
piece of equipment in the center is an Abekas A-60. 
This device will store 50 seconds ( 1 ,500 NTSC 
frames) of digital video; data can be loaded to 
across the network. The Abekas may be 
programmed to play the data back in segments onto 
a Sony Betacam-SP video recorder. The segments 
can be of any length and played back at any speed. 
The Betacam is used for a master recording. Video 
movies may then be transferred to VHS or Super 
VHS in any format (NTSC, PAL, SECAM). This 
computing power is indispensable for visualizing 
the complex data sets generated in the modern 
plasma laboratory. In the past year, we have come 
to depend on making videos quickly (overnight) to 
understand the data and alter ongoing experiments. 
Some of the real-time aspects of this analysis will 
be presented here. Our experiment and analysis 
tools have become so interdependent that we 
consider visualization an integral part of our 
laboratory. 


3. GRAPHICS PACKAGES 

The appearance of flexible and easy-to-use 
graphics packages is a recent phenomenon . Fifteen 
years ago, people wrote their own graphics pack- 
ages, which were highly device dependent. Graphi- 
cal output was often screen dumps from the termi- 
nal and line printer output. For want of video 
technology, 1 6-mm movies were made to illustrate 
three-dimensional effects. Because of the difficulty 
involved in their production, movies were made 
only for presentations at scientific meetings. For 
example, a movie made by one of the authors 
(Gekelman) showing the tearing of a current sheet 
[Gekelman et al., 1989] took about 2 months to 
complete. The process involved passing the data 
through a public domain three-dimensional contour- 
ing program (GRAPE), translating it from VAX 32- 
bit format to CRAY 64-bit, and inputting the 
contours to a surface builder and Tenderer written at 
the San Diego Supercomputer Center. Vector fields 
were treated with an algorithm written at UCLA, 
which drew three-dimensional vectors on the 
rendered data. The project consumed 1 20 hours of 
CRAY-YMP time and took 2 weeks to render. The 
final product was high-quality 1 6-mm film, but 
once a 16-mm film is made, there is no way to 
change it. 

That example is in marked contrast to what can 
be done today. Data are collected and moved across 
the network to a graphics workstation where they 
are translated from VAX to UNIX binary in a 
matter of minutes. The data are then rescaled and 
transformed from the probe to the laboratory 
coordinate system. A data set can be ready for 
rendering in several hours. A 1 ,500-frame video is 
programmed (selecting views, titling, and 
colorizing) in an hour and can be rendered over- 
night on one of our graphics workstations. It takes 
about 2 hours to transfer the frames to the Visual- 
ization Lab Abekas (this is an ethernet network 
bandwidth limitation) and another 1 0 minutes to 
make the video. In the future, we envision the entire 
process to be limited only by the time it takes to 
choreograph the video ! 

Table 1 lists some of the graphics packages 
used by the authors, as well as the source of the 
package and its principal use. There are, of course, 
other packages. 
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Table 1. Graphics packages used by the authors. 


Package 

Author 

Symbols 

Use 

AVS 

AVS INC. 

$$,N,I,T,A,M,nw,U,RT 

3-D, 2-D, data analysis, network 
based 

Data Visualizer 

Wavefront 

Technologies 

$$,N,I,T,A,m,nw,U,r, 

RT* 

3-D, 2-D data analysis, very easy to 
use 

Advanced Visualizer 

Wavefront 

Technologies 

$$$,I,nw,A,r,RT,T 

3-D, model building, animation 
standard for film industry 

■SSTSRSRifSnHIifl^BHMK 

Adobe 

$,I,u,T 


Vellum 

Ashar 

$,I,m,T 

2-D, 3-D drafting 

PV-Wave 

Visual Numerics 

$$,N,I,M,T,A, nw,U,r 

1-D, 2-D fast data analysis, 
contouring 

TVSOlib 

NERST 

Pd 

1-D, 2-D wireframe, contours 

movie BYU 

Brigham Young 
University 

Pd,T 

1-D, 2-D surface, contours 

NeoVisuals 

Neovisual Inc. 

$$,RT 

first decent ray tracer 

Snapshot 

Silicon Graphics 

N,I,T 

paint program with image processing 

SDCS Image Tools 

SDSC 

PD,N,I 

image translator 


XAOS Tools 

$$,T,RT 

ultimate titling 

Dore 

Kubota Pacific Inc. 

$,I,T,A,r 

real-time 3-D animation 


Key to Symbols 


$ = moderate in cost less than $1K 
$$ = expensive less than $4K 

$$$ = very expensive more than $4K (sometimes up to 
Pd = public domain, usually price of media 
N = new, less than 5 years old 
nw = can be run over a network 
RT = has a ray tracer 

r = Tenderer (Gourard or Phong) Some programs such 
as A VS and the Data Visualizer take advantage of 
the host computer’s specialized hardware. 


I = interactive 

m = has some mathematical functions 
80K) M = has extensive mathematical functions 
T = has some titling capabilities 
A = can make animation 
u = user may add own modules 
RT* = will use ray tracer of the Advanced 
Visualizer (if you have it) 

SDSC = San Diego Supercomputer Center 


4. EXPERIMENT SUPPORT 

The remaining part of this article illustrates 
how visualization tools are used in almost every 
step of our laboratory’s operation. CAD tools are 
routinely used to prepare drawings that can be 
submitted to machine shops. We have used 
Vellum; it is very easy to use, and we make high- 
quality drawings of everything we build. 
Previously, we were content with sketches. 

Apart from designing instruments, visualization 
tools are sometimes essential in understanding their 
measurement capabilities. One example of this is in 
describing the transmission function of a 
directional velocity analyzer. These devices are 
constructed by interfacing a collimator (array of 
narrow channels) and a conventional velocity 
analyzer [Stenzel et al., 1982; Stenzel et al., 1983; 


Leneman et al., 1991]. The behavior of this 
instrument is straightforward in an unmagnetized 
plasma, but difficult to describe in a magnetized 
plasma because of the helical nature of the particle 
orbits. To understand the operation of the 
directional velocity analyzer in a magnetized 
plasma, a computer program was written to 
calculate the instrument sensitivity as a function of 
perpendicular and parallel particle velocity for a 
given species (ion or electron) and background 
magnetic field strength. The numbers generated are 
then rendered as a surface whose height above a 
point in the Cartesian plane represents the 
sensitivity S(vn,vj_) of the analyzer. The surface 
shown in Figure 1 was colored to help easily 
identify changes in the sensitivity. The axes were 
added afterwards using Photoshop on a Macintosh 
Quadra. 
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Figure I. Instrumental sensitivity, S(va, v ||.), of a directional velocity analyzer shown as a function of the components of the velocity of 
the detected particle perpendicular and parallel to the local magnetic field. The red areas show regions of heightened sensitivity, and the 
blue areas show sensitivity too low to be useful experimentally. Black represents the region in velocity space from which no particles can 
enter the velocity analyzer. 


5. TWO-DIMENSIONAL DATA 

Two-dimensional data may be rendered as 
contour maps or color-coded surfaces (or a combi- 
nation of the two). As an example, we show data 
from an experiment on whistler waves. Figure 2a 
shows wave phase fronts in a plane that contains the 
background magnetic field. The plasma density is 
n = 2.0 x 1 0 1 '/cm 3 , the whistler frequency is f W ave = 
80 MHz, and the ratio of the wave frequency to the 
electron cyclotron frequency is f W ave/fce = -07. The 
red areas are wave magnetic field maxima, and the 
blue are minima. 

Generally, those involved in an experiment 
know where all the relevant probes, antennas, and 
boundaries are located. However, when communi- 
cating results to others not familiar with the geom- 
etry, the ability to render relevant components of 
the apparatus on actual data is extremely helpful. 
This is illustrated in Figure 2a, which depicts the 
exciter at the left, and Figure 2b, which shows the 
whistler data as it is located in the device. The 
orange square at the rear is the oxide-coated cath- 
ode, which is the plasma source. The circular lines 


show the boundary of the vacuum vessel. The 
exciter and data plane are in the midground. The 
non-data objects may be built by calculating all 
points on their surface and giving them a single 
scalar value; A VS then renders them as an 
isoscalar. Very complicated objects can be built 
with the Advanced Visualizer and then exported to 
AV S or the Data Visualizer. We have found that, 
when the data are complicated, placing objects in 
three-dimensional renderings is useful. Obviously, 
these techniques will soon be used for spacecraft 
and satellite measurements, where complicated 
boundaries and measurement geometries exist. 

Integrating three-dimensional data with digi- 
tized images is also useful for presenting the 
perspective of scale sizes. A cutaway view of the 
device with magnetic field data, B y . placed within it 
is shown in Figure 3. The wave magnetic field 
shown is that of a shear Alfven wave. Data were 
acquired on 10 parallel planes. The wave maxima 
are colored red, and the minima are blue. Green 
areas have zero wave magnetic field. One can see 
that B v is largest in a field-aligned filament. The 
wave data were rendered with the Data Visualizer, 



226 


Visualization Techniques in Space and Atmospheric Sciences 



Figure 2a. Data of whistler wave magnetic field intensity is shown as a function of position. The waves were launched in a uniform 
plasma from a trombone-shaped antenna, which has been rendered in the drawing on the left. Superimposing the exciter helps orient 
the viewer. The broad wave pattern refects the shape of the exciter. 



Figure 2b. A plane of wave data is rendered, to scale, with important components of the experimental device. Shown is the exciter (in 
white), the cathode t the orange square at the rear with a darker orange, circular emitting area ), and the walls of the device ( white 
circles ). A white measurement grid also is superimposed. 
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Figure 3, A cutaway view of the LAPD with measured Alfven waves in it. The Data Visualizer was used to generate a series of 
10 cutplanes, on which one component of the wave magnetic field ( B v ) is rendered. Red is used to denote wave maxima and blue minima. 
The image was imported to Photoshop, along with an image of the device output from a slide scanner. A part of the machine slide was 
cut away and the data image superimposed. This technique allows those unfamiliar with the geometry of an experiment to become 
instantly acquainted with it. 


and Photoshop was used to process the digitized 
image and superimpose the data. 

These same techniques are indispensable to 
theoreticians. Figure 4 shows a two-dimensional 
surface representing the spatial evolution of the 
magnetic field-aligned velocity distribution of 
electrons interacting with the electric field excited 
near plasma resonance in a non-uniform plasma. 
The initial electron distribution consists of a low- 
temperature spatially non-uniform part and a hot 
uniform Maxwellian tail population. The surface is 
color coded and textured to help identify features. 
Red represents high particle density and blue low 
particle density. The high-energy tail population 
interacts with the resonant electric field to form a 
beam-like velocity distribution through a process of 
phase-independent acceleration [Maggs and Mo- 
rales, 1990], 

The surface was produced using AVS with data 
generated by a computer calculation of the velocity 
distribution using an array of test particle orbits. 
Representation of the data in this fashion reveals 


details of the spatial evolution and helps illuminate 
the process of beam formation. Such a process may 
produce bursts of superthermal electrons in the 
Earth’s ionosphere and is a candidate for experi- 
mental study in the LAPD. 

6. VISUALIZATION TOOLS IN REAL-TIME 

DATA ACQUISITION 

In laboratory experiments that generate volume 
data, real-time feedback between the data and 
experiment is a necessity. It is disappointing, as 
well as wasteful, to spend days acquiring an enor- 
mous data set that is uninteresting. Furthermore, 
unexpected effects (which plasmas seem quite 
willing to provide) sometimes result in having the 
effect of interest spill out of a preprogrammed 
measurement volume. A plasma wave experiment 
involves varying one or more parameters in a 
discrete manner and making measurements at each 
point in the parameter space. The spatial dimen- 
sionality of the measurements may range from a 
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Figure 4. The theoretically calculated velocity distribution, f( v), of electrons interacting with an electric field at plasma resonance 
in a non-uniform plasma is shown as a function of velocity and space. Red indicates high density and blue low. The distribution starts 
out as a Maxwellian, but evolves into a distribution with several beam-like features after interacting with the wave. The beam-like 
features are indicated by the local maximums in the distribution on the surface nearest the viewer. 


single point (zero-dimensional) to a full volume 
(three-dimensional). A three-dimensional grid on 
which data are acquired can have as many as 
1 0,000 positions. The fixed parameters for a given 
run include the wave-launching antenna position, 
wave frequency, average plasma density, back- 
ground magnetic field, and so on. Variables taken at 
each point can be the local magnetic field, electric 
field, plasma density, and electron temperature. 

Measurements consist of digitized signals from 
probes inserted into the plasma. It is a simple matter 
to display these signals as a function of time as they 
come in. However, one is more interested in the 
variation of the data as a function of the fixed 
parameters and of position and time. Quantities 
such as plasma density or electric field must be 
extracted from the digitized signals. In general, 
significant processing is required to extract the 
quantities of interest. Basic processing includes 
offset removal, coordinate transformation, and other 
simple mathematical operations. More complex 
analysis includes curve-fitting, digital filtering, 


correlations, interpolation, and so on. Statistical 
analysis also is sometimes desired. Then, depending 
on the dimensionality of the parameter space and 
the spatial dimensionality, the data will be dis- 
played using the appropriate graphics package. 

To avoid slowing the data acquisition computer, 
all of the post-processing runs independently on 
two separate workstations. One is devoted to 
analysis, the other to visualization. They share 
access to a large (-1 Gbyte) region of disk memory. 
In an experiment involving the interaction of a 
whistler wave with a density striation, the electric 
and magnetic fields, as well as the plasma density, 
are measured for a given incident whistler wave 
frequency. The measurements are made on a series 
of planes that are perpendicular to the background 
magnetic field. The data acquisition system begins 
by setting the frequency and moving the probe head 
to the first position in the data plane. Each data 
component is digitized and stored for a number of 
plasma shots (typically 1 0). After all shots have 
been taken, the data acquisition system updates a 
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status file, which contains information such as the 
name of the current data set and the number of data 
records it contains. Then it moves the probe to the 
next position, and acquisition continues. When the 
plane has been mapped, the next wave frequency is 
set, a new data file is opened, the probe is moved 
back to the first point in the plane, and the process 
is repeated. 

Concurrently, the analysis process, which runs 
on a separate workstation, periodically checks the 
status file; if it finds that new data have arrived, the 
workstation proceeds with the analysis in the 
following manner. New data records are appended 
to a backup copy of the current data set. Then they 
are split into electric field, magnetic field, and 
plasma density components. The calculation then 
proceeds on two separate tracks. The current- 
voltage response, as measured by a Langmuir 
probe, contains density, potential, and electron 
temperature information. T o extract these from the 
digitized Langmuir probe data, a curve-fitting 
procedure must be used. The electric and magnetic 
wave field data are analyzed quite differently. For 
each component of these data, the wave amplitude 
and phase are calculated from correlations with the 
input signal. When both types of calculation are 
complete, the median values of plasma density and 
the wave amplitudes and phases are stored. The 
median values are appended to a display file, and 
the calculation status file is updated to signal the 
visualization process that there are new data to be 
displayed. 

The Application Visualization System (AVS) is 
used to display the data. To use AVS, one connects 
a number of simple program modules together to 
form a flow network. One module periodically 
checks the calculation status file for the existence of 
new data. If new data exist, other modules in the 
network are signaled to update the display. One 
component of the wave data is color coded and 
plotted as a two-dimensional surface. A contour of 
plasma density, which reflects the position of the 
density striation, is superimposed. The manager 
module also can be used to change which compo- 
nent is being displayed, and change data sets (i.e., 
different input frequencies), contour level, and so 
on. The time variation of the wave fields can be 
reproduced by advancing the wave phase in small 
steps. No matter how the display parameters are 


changed, the picture is continually updated with 
newly analyzed data. 

It takes on the order of 1 minute to accomplish 
all the probe motions and acquire and store all the 
shots of data at a single spatial location. Acquiring 
data over a full plane with decent resolution (30 by 
30 positions, for example) requires approximately 
15 hours. The analysis takes about 10 seconds per 
point, for a total of approximately 2.5 hours. Real- 
time display of data is critical for the experimenter 
to be able to make adjustments to experimental 
parameters and correction to the experimental setup 
in a reasonable amount of time. It is also extremely 
useful as a tool to enable a deeper understanding of 
the physics being measured and to guide the deci- 
sion as to the experimental directions to be pursued. 

7. THREE-DIMENSIONAL DATA 

Data from plasma physics experiments usually 
consist of vector fields [E(r,t), B(r,t)] and scalar 
fields [n(r,t), T e (r,t), V p (r,t)]. Three-dimensional 
scalar fields may be plotted as isosurfaces (i.e. , 
surfaces on which the quantity has a fixed value). 
Vector fields may be plotted as arrows. Sometimes, 
a large array of three-dimensional vectors can be 
confusing. One can then combine both techniques 
to plot a vector field. This is shown in Figure 5, 
which is experimental data of the tearing of a 
current sheet [Gekelman and Pfister, 1 988] . In this 
work, an initially uniform slab of electron current 
was forced to flow in a magnetoplasma. When the 
current sheet was much longer than its height, it 
tore into a three-dimensional array of magnetic 
islands and X type neutral points (locations at 
which the transverse magnetic field vanishes). To 
avoid confusion, the axial component (j y ) of the 
current was represented as a series of isosurfaces of 
varying transparency and color. These surfaces 
were parameterized by the axial current density. 

The transverse current was represented as a vector 
(jxdz) plot. In this experiment, there was a back- 
ground magnetic field in the y direction. One sees 
that structure has developed in the axial current. 

The locations at which j z is zero occur at magnetic 
X points. 

Finally, there is a class of three-dimensional 
data that a single image cannot capture. This is 
illustrated in a rendering (Figure 6) of two magnetic 
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Figure 5. Three-dimensional sutf aces of axial current density J y (orange-l. 2 A/cm2, green 1.0A/cm2, and blue 0.5 A/cm2) are shown at t= 4.7 5ms 
aftertheformation of a thin ( widthAength =0.1 ) current sheet. The magenta arrows indicate current flow perpertdicularto the background magnetic 
field. The yellow rectangle in theforeground shows the position of the slot through which the current is funneled att=0. 


flux tubes that have become intertwined. The 
experiment in which this occurred involved the 
interruption of a plasma current [Gekelman et al., 

1 987], The initial magnetic field lines caused by the 
plasma current are circles surrounding the current 
channel. When the current is forced to break up into 
eddies and return currents, the structure of the field 
lines and their associated flux tubes become com- 
plex. The two flux tubes shown here, arbitrarily 
colored red and blue, are linked in a complicated 
way. Unless the image is printed as a stereo pair (by 
plotting two images as seen by observers six 
degrees apart — the average angular separation of 
our eyes), it is not possible to determine whether 
the flux tubes are linked. Such a stereo pair was 
printed in the reference cited along with another 
flux tube pair, equally as complex but not linked. 


Without the use of stereo viewing, there is no way 
to tell them apart. An increasing number of work- 
station vendors supply hardware that allow their 
systems to show color stereo images. This is 
accomplished by alternately placing each of the 
image pair on the screen and synchronizing electro- 
Polaroid glasses with it. The stereo viewing process 
is greatly aided by machines fast enough to render 
images in real time as they are rotated. In some 
cases, a non-stereo system that can rotate 
isosurfaces in real time may suffice. 

8. DEVELOPMENTS IN THE NEAR FUTURE 

Ten years ago, visualization tools were used 
mainly to present results to scientists at meetings 
and seminars. With the increase in diagnostics and 
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Figure 6. Two flux tubes, created when a plasma current flowing along a background magnetic field is disrupted by switching on a strong 
orthogonal magnetic field, are shown in blue and red. The magnetic flux tubes are initially circular and surround the undisrupted current. 
After the current is interrupted, the flux tubes become complicated and intertwine. A stereo view is necessary to determine whether the flux 
tubes become linked. 


digitization techniques, they are now an integral 
part of the running experiment. Graphics are used to 
display data as it is taken and for rapid analysis of 
large data sets at intervals within an experiment. 
Images that used to be time and labor intensive to 
create and were only reserved for figures in publi- 
cations are now used as graphical scratch pads and 
“quick looks.” 

One of the limitations of probes in laboratory 
plasmas is their size. Because they are many Debye 
lengths (and sometimes electron Larmor radii) 
across, they perturb the plasma they are measuring. 
Laser diagnostics are non-perturbative but, because 
of expense, have been limited to only a few chan- 
nels of data acquisition. This is changing. Fabrica- 
tion techniques will enable the production of 


microscopically small probes. The price of lasers is 
dropping, and inexpensive tunable solid state lasers 
are emerging. Along with these advances, there has 
been a steady drop in the cost of high-speed analog 
to digital converters. It is not hard to picture future 
laboratory experiments with hundreds of thousands 
of data channels generating up to 1 00 terabytes of 
data per run. Such experiments will be necessary to 
investigate chaotic and turbulent systems in a 
systematic way. These problems will be attacked by 
workstations of the future that will be armed 
gigabytes of ram and terabytes of disk or other 
inexpensive storage. It is obvious that visualization 
tools and scientific data base management tech- 
niques will be indispensable for every step of the 
process. Coupled to these hardware advances will 
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be innovative ways to present data. This could be 
done, for example, by allowing others to access 
animations of results over high-speed networks. 
Eventually, cellular technology will enable these 
networks to export films and journal articles to 
laptop computers. Today ’ s means of presenting 
scientific information will soon become obsolete. 
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Envision is a software project at the University of 
Illinois and Texas A&M, funded by NASA ’s Applied 
Information Systems Research Project. It provides 
researchers in the geophysical sciences convenient 
ways to manage, browse, and visualize large observed 
or model data sets. Envision integrates data manage- 
ment, analysis, and visualization of geophysical data 
in an interactive environment. It employs commonly 
used standards in data formats, operating systems, 
networking, and graphics. It also attempts, wherever 
possible, to integrate with existing scientific visualiza- 
tion and analysis software. Envision has an easy-to- 
use graphical interface, distributed process compo- 
nents, and an extensible design. It is a public domain 
package, freely available to the scientific community. 


1. INTRODUCTION 

Envision is an interactive software package for 
the analysis and display of measured and modeled 
datasets [Searightet al., 1993a; Searight et al., 
1993b], The key features of Envision are support 
for data in NetCDF and HDF files, an easy-to-use 
X/Motif user interface, distributed process capabili- 
ties in a client-server arrangement, and portability 
to many UNIX workstations. The Envision pack- 
age also provides the user new ways to view and 
change metadata in a set of data files. It permits a 
scientist to manage conveniently and efficiently 
large data sets consisting of many data files. It also 
provides links to popular visualization tools so that 
data can be quickly browsed. 

1.1 Motivation 

Earth science researchers work with increas- 
ingly larger data projects. They work with both 
large observational data sets and large model 
outputs. The Earth Observing System Data and 
Information System (EOSDIS) illustrates the 
magnitude of the data explosion in the earth sci- 
ences. EOSDIS will produce a large variety of data 
sets, ultimately containing more than 10 petabytes 
(10 15 ) of data (M. Folk, NCSA, personal communi- 
cation, 1993). To carry out successful research on a 
project such as this, scientists need to be able to 
browse, visualize, and analyze large and varied 
earth science data sets . 

There are a number of problems to address in 
designing software to work with large data sets. 
When dealing with large data sets, the issue of data 
management becomes critical. Another important 
consideration is the handling and use of metadata, 
or data about the data. Earth scientists have to deal 



234 


Visualization Techniques in Space and Atmospheric Sciences 


with data sets composed of many files and need 
interactive, random access to any part of them as 
transparently as possible. Data subsets should be 
easily and quickly displayed with a range of visualiza- 
tion options for analysis purposes. Researchers must 
also be able to integrate their own visualization and 
analysis programs with a data management system. 
These concerns form the basis for the design and 
implementation of the Envision package. 

1.2 Project Goals 

Envision was created to address the lack of 
effective tools to manage and browse large scien- 
tific data sets. Clearly, there are other software 
packages in existence that offer partial solutions to 
these problems. So, rather than starting from 
scratch, Envision was designed to take advantage 
not only of existing standards in file formats, 
hardware, and software, but also to interface with 
existing packages and software libraries, where 
possible. Envision’ s goal is to integrate the areas of 
data management, analysis, and visualization 
together in an interactive environment. The 
project’ s design is modular and distributed to allow 
for flexibility and extensibility, but it also tries to 
hide from the user unnecessary detail about the 
storage of data values in files. To achieve wider 
use, Envision is a public domain package that will 
run on most UNIX workstations. 

2. SYSTEM ORGANIZATION 

2.1 Requirements 

The Envision package is designed to work with 
commonly used standards in data formats, operating 
systems, networking, and graphics. It runs on many 
UNIX workstations using X/Motif, including IBM 
RS6000, Sun, HP, and SGI. No special hardware is 
needed. Additional ports of Envision to other 
UNIX platforms are in progress. 

Data sets intended to be used in Envision are 
stored as rectangular grids of values in n-dimen- 
sions. Envision currently works with the NetCDF 
format [Rew and Davis, 1990], developed by 
Unidata at the University Corporation for Atmo- 
spheric Research (UCAR) and HDF [Brown et al., 

1 993] , a format developed by National Center for 


Supercomputing Applications (NCSA) at Univer- 
sity of Illinois. HDF is the format recently adopted 
by the ESDIS project as the baseline standard for 
EOSDIS data product generation, archive, ingest, 
and distribution. 

2.2 Major Components 

Envision’ s key features are a metadata browser 
and editor, a data management system, and a set of 
links to visualization and analysis tools. Figure 1 
shows a schematic of the system, which is modular 
in design. Envision’ s principal components are the 
Envision Data Manager (EDM) and Envision User 
Interface (EUI), which run as separate processes in 
communication with each other. At the center of 
Envision is EDM, which acts as a server, to which 
any number of client processes, including EUI, may 
connect. Visualization and analysis tools are 
connected as clients, and EDM exchanges messages 
and data with them. Visualizations are performed 
using the public domain Xlmage and Collage 
programs from NCSA and the commercial IDL 
package from Research Systems, Inc., of Boulder, 
CO. The modular design of Envision provides 
extensibility, because other tools may be added to 
the Envision environment. EUI is an intuitive 
point-and-click graphical environment and has 
flexible customization features to view data sets. 

3. DATA MANAGEMENT 

Data management is one of the key purposes of 
Envision. EDM provides mechanisms to manage 
data variables, dimensions, and attributes in a group 
of related data files. 

3. 1 Managing Multiple-File Data Sets 

Data used in a large project are invariably 
contained in a set of data files. In practice, this 
organization of data can be unwieldy to manage. 
One way a data set might be stored is on a CD- 
ROM, with data files stored in a hierarchy of 
directories representing years, months, and days. 
The data in these files are conceptually a single 
entity, but have been divided for convenience of 
storage or because of its collection over a period of 
time. In a scenario like this, problems would arise 
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Figure 1. Envision schematic. Black arrows indicate passing of messages and metadata between separate processes. The green 
arrow indicates reading and writing of data values and metadata to and from files. Red arrows indicate direct transmission of 
data values by interprocess communication. 
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if a user wished to view a subset of the whole data 
set when the values wanted were stored in a number 
of these files. For example, a user might wish to 
see a display where time is an axis. 

Management of variable and dimension objects 
that span multiple files is done in Envision by 
matching up the objects that correspond in each of 
the files. Envision organizes large data sets into 
projects and the metadata, or data about the data, is 
saved in project files. The operation of matching 
up and grouping the same variables and dimensions 
in different files is referred to as merging. This 
operation creates new entities, called logical 
objects. Logical objects are simply a way to group 
together regular dimensions and variables that span 
multiple files, so they can be used as if they were 


all in a single file. Envision also handles situations 
where dimensions may be implied by the division 
of data into files. A dimension, quite often time, 
may not be explicitly defined, but can be assumed 
when each file in a data set represents a single step. 
In Envision, this type of dimension may be created 
and is called a virtual dimension. Once a virtual 
dimension is defined, it can be used exactly like any 
other dimension. T ypically , these are created for 
each data file and then merged together to make a 
logical dimension. 

A simple example will serve to illustrate how 
the merging of dimensions and variables into 
logical objects works. Figure 2a shows a set of 
three data files, each containing one variable, 
TEMPERATURE, and the two dimensions it uses, 
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Latitude and Longitude. In each file, the Latitude 
and Longitude dimensions have coordinate vari- 
ables with the same set of values as shown. There 
is no time dimension specified, but each file repre- 
sents a single month time step. The variable, 
TEMPERATURE, contains 7 x 13 = 91 values in 
each of the data files. Figure 2b shows the same 
three files after a virtual dimension. Month, has 
been created for each and then added to the vari- 
able, TEMPERATURE. These changes are made 
only in the Envision project, and the data files are 
not actually modified. Note that TEMPERATURE 
has 7x13x1=91 values in each file, the same as 
before. Figure 2c shows the result of merging all 
the corresponding variables and dimensions into 
logical objects. The logical dimension. Months, is 
a composite of the virtual dimension, Month, from 
each file. The logical variable, TEMPERATURE, 
contains 7 x 13x3 = 273 data values. 

The advantage in using logical variables and 
dimensions is that once they are created, the user no 
longer has to know which files are needed to 
retrieve a subset of the data values for that object. 
The complexity of working with multiple files is 
reduced so that the user can think of terms of a 
project rather than about individual files. Using 
virtual dimensions gives the user the ability to 
increase the dimensionality of variables when 
dimensions are implied by the division of data into 
files. 

3.2 Other Data Management Functions 

EDM performs a large number of tasks within 
Envision. In addition to handling variables and 
dimension objects spanning multiple files, it 
manages the metadata associated with these objects. 
EDM also controls the flow of data values between 
the data files and visualization and analysis mod- 
ules connected to it. 

4. USER INTERFACE 

EUI is the principal tool that a scientist uses to 
work with a data set in an Envision project. EUI is 
intended to be an easy-to-use, intuitive way to 
browse both metadata and data. The metadata 
describes data values and includes information such 
as collection or generation of the data and its 


subsequent manipulation. Examples of simple 
metadata are variable name and units. More 
complex metadata might include a list of processing 
steps used to generate a particular set of data 
values. 

4.1 User Interface Design 

EUI is a client process that connects to EDM. 
The interface is a point-and-click environment 
written in X/Motif. The main part of the user 
interface is a table display of dimensions and 
variables, as shown in Figure 3. At the top of the 
window is the menu bar, which has a series of pull- 
down menus for working with projects, data files, 
the appearance of the table, metadata access, and 
visualization. In the table display, dimensions 
(independent variables) are represented as columns, 
and variables (dependent variables) are shown as 
rows. Raised buttons indicate which dimensions 
are used by which variables. Where defined, the 
unit attribute for each dimension and variable in the 
table is displayed along with its name. 

Dimensions Menubar 



Variable not defined Variable defines 
Variables in this dimension in this dimension 

Figure 3. Envision user interface. 

The table display is also customizable by the 
user through the creation of different views of the 
project. In Envision, a view is a subset of the 
variables in a project. Within a view, the order of 
rows and columns may be changed, and variables 
not being used may be hidden. In this way. a user 
can reduce the size of the table display and can 
efficiently browse through a large project by 
looking only at the variables of interest. Also, a set 
of views in a project can be created for different 
purposes. For example, if several people are 
working with the same project, each may wish to 
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see the table with different contents and 

arrangements. 

4.2 Metadata Browsing and Editing 

One of the key uses of EUI is to get access to 
the metadata associated with a project. The usual 
way to look at metadata in the past was to dump the 
contents of a data file using a utility such as 
ncdump. Envision makes metadata browsing more 
efficient through interactivity. The table display 
itself shows the overall structure of metadata for a 
project, a compilation of a set of data files. EUI 
also has extensive reporting features, which give 
pop up windows containing variable, dimension, 
and global attributes for the entire project or 
individual data files. Several levels of information 
detail can be switched among windows using a push 
button approach. 

Another important feature of EUI is metadata 
editing. Metadata may be deleted, changed, or 
added to using pull-down menu options. There are 
a variety of applications of these capabilities. One 
use is the option to rename variables. For example, 
a variable named “T” could be changed to Tem- 
perature. If there were no units defined for a 
dimension or variable, these could be added. Also, 
if the metadata contained errors, these could be 
quickly corrected using EUI. 

As was mentioned previously, a data set may be 
on a read-only medium, such as a CD-ROM. In a 
situation like this, it is clearly not possible to modify 
anything in the data files. Because Envision stores 
metadata in a project file, metadata on a read-only 
medium can still be modified or added to. Even 
when users are capable of modifying the data files, 
they may prefer to avoid the overhead of rewriting 
the files. A disadvantage in duplicating metadata in 
Envision project files is that it uses more disk space. 
There is also a risk that someone could change the 
data files outside Envision without updating the 
project files. To combat this possibility, Envision 
provides the option to write metadata changes back 
to the data files. A user also can create new data 
files from subsets of variables and their dimension 
ranges in a project. From a conceptual viewpoint, it 
is not critical where the metadata are stored, as long 
as they can be conveniently accessed. 


5. VISUALIZATION 

Envision does not perform any visualization 
itself, but acts as a data server for visualization and 
analysis tools. 

5 . 1 How Envision Does Visualization 

Creating a display in Envision involves a few 
steps. First, a variable is identified in EUI for 
further analysis. The dimensions are then selected 
for display axes by clicking on the raised buttons 
intersecting the variable row. For example, for a 
two-dimensional visualization, X and Y axes would 
be chosen. A display program, such as Collage or 
IDL, is chosen from a pull-down menu. At this 
point, another window interface pops up. This 
interface allows the user to restrict the ranges of the 
dimensions and set options for the visualization 
display. When the user requests a display, the data 
values are fetched from the data files and assembled 
in the proper order by EDM, then sent directly to 
the visualization program. Other ranges or options 
can be set again and new displays created, without 
going back to EUI. 

5.2 NCSA Collage andXImage 

Two public domain visualization programs used 
with Envision are Collage and Xlmage, which were 
written by National Center for Supercomputing 
Applications (NCSA). Both are raster image 
viewers that can display single frames or anima- 
tions. Xlmage is several years old and has been 
largely replaced by Collage in the last year or so. 
Figure 4 shows an example of visualization options 
in Collage using TOMS ozone data. Figure 4a is 
the Collage options interface, which is written in XI 
Motif specifically for Envision. The other windows 
in the figure are from Collage. They illustrate a 
variety of available tools. 

Another important feature of Collage is that it is 
a collaborative tool. This means that all the win- 
dows produced in the program can be simulta- 
neously viewed on different networked worksta- 
tions. This gives distant users or those just down 
the hall the ability to analyze and browse the same 
data sets together. 
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5.3 IDL 

Another program with which Envision works is 
IDL. which is an interactive environment used to 
perform calculations on and visualizations of 
scientific data. IDL is used in Envision to create 
several visualization modules that draw one- 
dimensional and two-dimensional plots. Figure 5 
show s an example of a two-dimensional contour 
display. The interface in Figure 5a is similar in 
function to the Collage options interface in Figure 
4a. but is written entirely by IDL widget functions. 
The two IDL programs serve as examples to IDL 
users of how they can use EDM and EUI together 
with IDL functions to create an enhanced working 
environment. 
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6. STATUS AND PLANS 

6.1 Getting Envision 

Envision is a public domain software package and 
is available by anonymous FTP. The primary site is 
vista.atmos.uiuc.edu in the directory pub/envision, and 
the secondary site is csrp.tamu.edu in the directory 
pub/envision. Available at both locations are software 
binaries and source, documentation, and sample data 
sets. Instructions for obtaining NetCDF, HDF, 
Collage, Xlmage, and IDL are also provided there. 

For additional information, contact Keith Searight at 
k-searight@ uiuc.edu. 



— 

(a) 



(b) 


Figure 5 . IDL contour plot — (a) interface built with IDL widgets; (b) two-dimensional contour plot. 
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6.2 Current Work 

The first Envision beta version was released in 
June 1993 for IBM RS-6000, Sun, HP, and SGI 
workstations. Work under way includes adding 
new IDL modules for three-dimensional surfaces 
and geographic displays with continent outlines 
using different map projections, as well as a port to 
the DEC Alpha using OSF/1 . Another important 
feature currently being developed is a generic 
library for sending and receiving messages with 
EDM. This will allow users to easily combine their 
own visualization and analysis modules with 
Envision. Improved ways to automatically merge 
dimensions and variables are also under 
investigation. 
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Space science information systems involve accessing 
vast data bases. There is a need for an automatic 
process by which properties of the whole data set can 
be assimilated and presented to the user. Where data 
are in the form of spectrograms, phenomena can be 
detected by pattern recognition techniques. Presented 
are the first results obtained by applying unsupervised 
Artificial Neural Networks (ANNs) to the classification 
of magnetospheric wave spectra. The networks used 
here were a simple unsupervised Hamming network 
run on a PC and a more sophisticated CALM network 
run on a Sparc workstation. The ANNs were compared 
in their geophysical data recognition performance. 
CALM networks offer such qualities as fast learning, 
superiority in generalizing, the ability to continuously 
adapt to changes in the pattern set, and the possibility 
to modularize the network to allow the inter-relation 
between phenomena and data sets. This work is the 
first step toward an information system interface being 
developed at Sussex, the Whole Information System 
Expert (WISE). Phenomena in the data are 
automatically identified and provided to the user in the 
form of a data occurrence morphology, the Whole 
Information System Data Occurrence Morphology 
(WISDOM), along with relationships to other 
parameters and phenomena. 


1. INTRODUCTION 

Automatic classification of geophysical data is 
needed because of the vast quantities of data being 
generated and because of the nature of the data — 
often multidimensional and interrelated to other 
data sets. There are several existing and developing 
space database access tools, such as the European 
Space Information System (Query & Correlation 
Environment) and the Space Physics Query Lan- 
guage for use with data that, in general, have 
already been preprocessed into simple two-dimen- 
sional form, or derived parameters . 

However, much of the data generated by 
modern instruments are in three-dimensional form 
(or higher dimensions), such as wave frequency 
spectrograms and particle energy spectra against 
time. Phenomena are often difficult to extract by 
simple algorithms to give derived parameters, but 
they can generally be extracted visually by trained 
data analysts. For example, multiple simultaneous 
wave emissions are difficult to separate, as are 
weak signals close to the background level, such as 
the continuum emissions shown in the example 
used below. 

Data analysis person-hours are usually limited 
by strategic reasons so that only a limited part of 
any given data set is ever analyzed, and with little 
by way of global morphologies produced. Fortu- 
nately, the recognition of phenomena in three- 
dimensional color spectrograms is similar to the 
traditional problem of image recognition for which 
ANNs have been increasingly used in recent years. 
ANNs open up the possibility of automatically 
analyzing the full data set. 

Presented here are the first results obtained by 
applying two types of ANNs to the same geophysi- 
cal data example. Principles of feature identifica- 
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tion and automatic classification are demonstrated, 
paving the way for the whole data set to be 
searched to provide a complete feature morphology. 
Scientific phenomena are identified as interrelated 
feature phenomena of different data sets. 

2. GEOPHYSICAL DATA EXAMPLE USED 

The data set chosen is that of natural wave 
emissions measured in the vicinity of the Earth by 
the ESA satellite GEOS- 1 wave experiment consor- 
tium. S-300. The particular data example. 

Figure 1 a, was obtained during an inbound part of 
the orbit of GEOS- 1 where the spacecraft passes 
through a continuum emission generation region at 
04:00 UT on day 63, 1977. The data plotted are 
from a narrow-band wave filter that was stepped 
from 0 to 77kHz in 256 steps of 300Hz, taking 22s 
per frequency sweep. Natural electrostatic waves 
increase in frequency during the plot, reflecting the 
increases in local plasma frequency and local 
electron gyrofrequency as the spacecraft moves 
closer to Earth into regions of higher plasma 
density and stronger magnetic field. 
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At 04:00 UT, the electrostatic wave intensity 
becomes so strong that there is some conversion to 
weak electromagnetic continuum waves, or radio 
waves, that propagate away from this region. Indeed, 
these electromagnetic continuum waves are seen 
well before the generation region is reached as a 
well-defined chevron pattern in the frequency-time 
spectra. This pattern results from the fact that these 
radio waves are propagating directly away from the 
source region and are modulated by the satellite- 
receiving antenna spin. This modulation beats 
against the frequency sweeping to give the observed 
chevron pattern. Also present in the data at high 
frequency is an interference line generated by the 
telemetry and voltage converter clocks with further 
continuum emissions superimposed, while at the 
lowest frequencies there are strong electromagnetic 
emissions present throughout this data example. 

3. SIMPLE UNSUPERVISED HAMMING 

NETWORK CLASSIFICATION 

The first network used is a simple unsupervised 
Hamming network similar to the supervised Ham- 
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Figure I. Classification — (a) GEOS wave data classification example; (b) simple unsupervised Hamming network classification. 
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ming network flown within the SPREE instrument 
on STS-46, but modified for self-learning and 
operation on a PC. A detailed description of the 
original ANN is given in Gough (1993). A brief 
description of the modified ANN and the method of 
application are presented here. 

Data preprocessing consisted of a simple con- 
version of the multibit spectra into a 1-bit image by 
comparison of individual data points with the local 
average intensity around that point on the frequency- 
time spectra. Preprocessed data in this example are 
shown later in Figure 4. The Hamming ANN con- 
sists of 16-RAM neurons each with a 10-bit-wide 
address corresponding to 1,024 address locations 
(each of 16 bits) per neuron. The total of 160 
address bits were mapped with a fixed random 
connection to a box that was scanned across the 
preprocessed frequency-time spectrogram. All 
neuron RAM contents were initialized to zero, 
before learning. For each position of the box on the 
preprocessed data image, the image bits then de- 
fined, via the fixed mapping, individual locations 
within the 16 neurons. The contents of these 16 
locations were read out. If the contents of a particu- 
lar RAM were zero then other addresses were 
searched out up to a limiting Hamming distance 
from that address to find the closest address with 
non-zero contents. During learning, RAM data 
values were compared and a score generated for 
each, depending on how many of the neurons had 
the same value. At this stage, a value is entered into 
all RAMs at the locations addressed by the data 
covered by the box. If the score was less than 4, a 
previously unused negative number is the value 
stored. If the score was greater or equal to 4, and the 
value of the RAM with the highest score was 
negative, then a new, previously unused positive 
number is the value entered. If the score was be- 
tween 3 and 10, and the value of the RAM with the 
highest score was already positive and non-zero, 
then that positive number is the value entered into 
all RAMs. The system described here has some 
similarities to the Sparse Distributed Memory of 
Kanerva (1986) and to the n-tuple RAM neurons of 
Aleksander and Stonham (1979). 

In this way, features in the data initially enter 
the RAMs as negative numbers, but once these 
features recur, they are transferred to a recognized 
feature class with a positive number. Although the 


negative values can grow to large sizes with recy- 
cling through the negative number range, the data 
example shown below only generated positive 
numbers up to the value 4 (four classes identified). 
Preprocessing, data scanning, and learning are fully 
automatic in this scheme. Once learning has fin- 
ished unseen data can be classified by again scan- 
ning the box, but this time only reading RAM 
contents and only choosing the positive RAM value 
with the highest score. Although searching through 
RAM addresses within the limiting Hamming 
distance is achieved quickly, it can be eliminated by 
annealing the RAMs. Annealing consists of replac- 
ing all zero and negative values by the closest 
positive value (nearest in address Hamming dis- 
tance). All of the major wave features present in 
Figure la were automatically separated by the 
unsupervised Hamming ANN into four different 
feature phenomena, denoted by color in Figure lb. 
Retrospectively, an expert data analyst can easily 
name these four classes of feature phenomena: 
red — electrostatic cyclotron harmonic waves 
governed by the local plasma density and local 
magnetic field strength; blue — low-frequency 
electromagnetic emissions; yellow — electromag- 
netic continuum waves propagating away from the 
generation region; and green — local interference 
line from voltage converters/telemetry switching in 
the spacecraft. All the salient phenomena have been 
classified by this network, including separation of 
the weak higher frequency continuum when it is 
superimposed on the interference line (e.g., around 
03:40 UT). 

4. UNSUPERVISED CALM 

CLASSIFICATION 

CALM was developed to address problems in 
ANNs, such as lack of stability, lack of speed, 
inability to both discriminate between and general- 
ize over feature vectors (hereafter, vectors), and 
catastrophic interference [Murre, 1992], The 
module is related to Grossberg’s ART neural 
networks [Carpenter and Grossberg, 1987]. A 
CALM module can be described as a clustering 
mechanism that is able to discover statistical 
regularities in a stream of vectors. The module acts 
like a negative filter, subtracting known vectors 
from such a stream, so that novel vectors are 
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learned more intensively than the ones already 

known. 

CALM consists of a layer of input nodes and a 
layer of output nodes connected to each other in a 
special way. During the learning phase, CALM 
categorizes every presented vector into a single 
cluster, represented by an output node. The number 
of clusters CALM can use for categorizing is 
limited by the amount of output nodes given. 

To increase the stability, learning can vary from 
elaboration learning (about 200 learning cycles for 
ANN conversion) to activation learning (about 50 
cycles). A novel vector causes elaboration learning 
with the formation of new associations, whereas a 
known vector just strengthens the existing associa- 
tions. A high learning speed is achieved by using 
arousal effects, node competition, and noise within 
a module. The last ensures that convergence is 
reached with every vector, as it resolves competi- 
tion deadlocks in the module. The combination of 
single modules with a hierarchical structured ANN 
eliminates the effect of catastrophic interference if 
none of the module’ s capacity is exceeded. This 
means that well-learned vectors are not replaced 
when new vectors are learned and the old vectors 
are not presented again. 

The last section has shown how a crude classifica- 
tion can be achieved with simple preprocessing effort 
and a simple ANN. Current work concentrates on 
improving the preprocessing techniques to obtain 
abstract vectors for a classification with the more 
sophisticated CALM. The following example will 
show how abstract vectors with intensity, smallest 
frequency, orientation, and circularity components are 
gained from the example data set. The categorization 
result for a module with 7 input and 10 output nodes, 
to accommodate 7 vector components and a maximum 
of 1 0 clusters, will then be presented. 

Preprocessing the spectrogram to the features 
desired is a sequential process. An optional smooth- 
ing of the spectrogram by low-pass filtering pre- 
cedes a two-step segmentation. First, local back- 
ground dependent filtering, such as in the last 
section, yields a black-and-white picture where 
black areas denote local maxima. These areas 
represent features but cannot be used solely to 
extract all vector components needed. Second, 
sobel edge detection and conditional bit-wise 
copying of pixels from the original spectrogram to 
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the black-and-white picture result in a picture 
containing all features surrounded with a black 
boundary against a white background, shown in 
Figure 2a. The feature areas are then identified by a 
simple search algorithm, which allows a flood fill- 
related algorithm to find the pixels within the 
boundary and the boundary coordinates for each 
area. While the algorithm fills an area, it evaluates a 
pixel intensity histogram and stores the boundary 
coordinates. Thus, the vector component’ s inten- 
sity, smallest frequency, orientation, and circularity 
can easily be computed and normalized for all 
features to yield a set of vectors. Here, three inten- 
sity components were gained from the intensity 
histogram and the smallest frequency component 
from the minimum boundary Y -coordinate. The 
feature angle from 0 to 90 degrees to the horizontal 
was used for two orientation components, and the 
circularity component was the ratio of the number 
of pixels inside the feature area to the square of the 
number of boundary pixels. 

Unfortunately, the vector set cannot be pre- 
sented to CALM right away. The components need 
to be weighted (multiplied by a parameter) to 
achieve an optimal classification result. Suitable 
parameters are found by using a simple genetic 
algorithm [Goldberg, 1989] with a human classifi- 
cation as reference. A genetic algorithm is a mecha- 
nism that uses random choice as a tool to guide a 
highly exploitative search through a coding of a 
parameter space. With such optimized vectors, an 
untrained CALM found clusters as shown in Figure 
2c. Different colors represent different clusters and 
hence phenomena classes, as for section 3. The 
result is quite similar to the ideal classification of a 
data analyst given in Figure 2b. 

As the learned information is contained in the 
network’s learning weights, it is possible to store 
and restore these (pretrained network). This permits 
a quick scan through further data sets with or 
without additional elaboration learning. Both 
identify new, previously unknown features, where 
the former classifies all features and learns new 
features (continuous learning) and the latter quickly 
classifies all features (within 3 seconds for this data 
set). Apart from defining what kinds of vector 
components to use and an initial, genetic algorithm- 
supported, component parameter setting, the above 
processes are fully automatic. 
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Figure 2. CALM example — (a) preprocessed spectrogram; (b) data analyst classification; (c) ANN classification. The data analyst 
has classified the features from the preprocessed spectrogram into four different classes with the colors used in Figure I(b ). Black 
features denote unclassified features. The ANN classification shows how four known and five new classes (new colors ) were found 
by CALM. Note that different color scales were applied in Figure 1(a) and in the preprocessed spectrogram here. 


There are two false classifications in the red 
class. The small blob has been mistaken as a yellow 
and the thick line as a blue member from similar 
intensity distributions, angles, and circularities. In 
these cases, the Euclidean distance to members of 
the false class was smaller than to members of the 
correct class or too small to meet the discrimination 
threshold (see below). CALM has separated the 
yellow class into yellow and turquoise subclasses. 
Both subclasses differ slightly in their intensity 
distribution. Some of the yellow members were 
classified as red members because of inadequate 
segmentation, which resulted in false angle 
components. 

A straightforward definition of classification 
accuracy, where a human classification is taken as a 
reference, shown in Figure 2b, and the percentage 
of correctly categorized features for each class is 
computed, shows that 62 percent of the yellow, 100 
percent of the blue, 84 percent of the red, and 1 00 
percent of the green class features were correctly 
classified. The overall accuracy is 87 percent. 

5. COMPARISON OF ANNs USED 

The relatively crude Hamming network used 
first has the advantage of simplicity and reliable 


effective classification. It is quite fast, taking 1 
minute on a 33MHz 80486 to learn the data ex- 
ample of Figure 1 , and has low memory require- 
ments, with the whole PC demonstrator program 
taking less than 1 Mbyte. However, with the 1 -bit 
preprocessed image, it is only as good as the 
preprocessing algorithm used, but an effective 
algorithm can be found for most space plasma 
physics data types. The only user input is to select 
the size of the scanning box and the algorithm for 
pre-processing. In both cases, these selections, once 
optimized by trial, would be valid for all data of 
that data type in the data base. Major disadvantages 
of this simple ANN are lack of scaling to data 
feature size in the image and lack of sensitivity to 
intensity differences between feature types. 

The developed software package for prepro- 
cessing and CALM was run on a Solbourne S4000 
with a Sparc 1 processor. The preprocessing differs 
from the one in the last section. Here, each feature 
is represented by seven scaling independent, 
normalized, real-value components. Thus, the main 
feature properties are emphasized. There is no limit 
of dimensionality, so that one can use as many 
descriptors (components) as needed. The above 
result suggests the use of more and/or better de- 
scriptors to avoid misclassification. The discrimina- 
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tion threshold of 0.8 for the minimum Euclidean 
distance to categorize vectors in different clusters, 
along with a maximum component value of 1 .0, 
shows why this might be necessary. The prepro- 
cessing can be as fast as the machine and an opti- 
mized program code allows; here, it was around 1.5 
minutes for the example spectrogram. So far, little 
effort has been applied to code optimization. 
Categorizing of the vector set, which contained 77 
vectors, took 1 4 seconds. A smaller number of 
output nodes than clusters causes the network to 
generalize over the input vectors. This may be an 
advantage if a coarse classification is required or a 
disadvantage if fine classification is needed. In this 
example, the number of output nodes was larger 
than required, which allowed the categorization of 
new features. As CALM modules can be combined 
to increasingly complex hierarchical ANNs, it is 
possible to find more abstract classes within interre- 
lated phenomena. If the classes for a first layer of 
modules were letters (features), a second layer 
could combine them to words (feature phenomena) 
and a third layer to sentences (scientific phenom- 


ena). It might be necessary to tune the dynamics of 
individual modules in each layer to yield an optimal 
result as, for example, the generalization behavior 
also depends on the module parameters. It has been 
shown that genetic algorithms can help here as well 
[Murre, 1992], 

6. PROPOSED SYSTEM: WHOLE 

INFORMATION SYSTEM EXPERT (WISE) 

Repeated application of ANNs to data bases can 
provide a degree of automatic data analysis. Work 
at the University of Sussex is aiming toward a 
system known as WISE [Gough, 1992]. In this 
system, illustrated in Figure 3, ANNs are applied to 
data sets in a series of stages: 

Stage 0: The user (expert) optimizes the 

preprocessing needed for any given 
data set before the use of ANN. In 
genera], this is conversion of data into 
image form without loss of essential 
features. Feature vectors are then 
generated. 
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Stage 1 : Unsupervised ANN is applied to the 
feature vectors to classify features 
present in the image, similar to the 
traditional use of ANN in image/ 
pattern recognition. A local data base 
stores ANN weights, and thus features 
are recognized. 

Stage 2: The patterns of occurrence of features 
in the data set are themselves classi- 
fied by ANNs, statistics packages, and / 
or logic programming in a similar way, 
and again stored in the local data base 
as feature phenomena. 

Stage 3 : Similar methods as in stage 2 are used 
to find interrelationships between 
feature phenomena in the same or 
different data sets to identify scientific 
phenomena. The final results are then 
displayed, for example, like GEOS 
morphologies (see section 8). 

WISE would include toolkits for signal and 
image processing, neural networks, and statistics 
packages to achieve the above stages. The main 
product is the Whole Information System Data 
Occurrence Morphology (or WISDOM). An ex- 
pected scenario of WISE use is as follows. The 
expert familiar with each data set, probably the 
instrument PI or Col, optimizes preprocessing and 
runs WISE unsupervised on the whole data set. This 
expert labels features found and related feature 
groups, or phenomena, with the names known in 
that community of researchers. It is expected that 
such a system could reveal new phenomena or 
relationships at this stage. Later, a nonexpert user 
could apply the trained WISE system to identify 
any feature in the data set and instantly be given its 
morphology, or importance, along with all related 
features. In this way, space-acquired data bases can 
be better utilized by nonexperts and, being opened 
up to a wider user community, would become more 
fully utilized. At present, it is often estimated that 
less than 1 0 percent of all space plasma physics 
data has been accessed. 

7. INITIAL WISE FRONT-END PC 

DEMONSTRATOR 

A prototype front-end for WISE was written 
incorporating the simple unsupervised Hamming 


network running under Windows on a PC and 
demonstrated at AGU in the spring of 1 993. It can 
be seen from the main screen in Figure 4 that the 
preprocessing and ANN data classification can be 
fully monitored as processing occurs, with learning 
statistics keeping the user updated on patterns that 
are learned. At any time, the mouse can be used to 
test the response at a given location in the data. It is 
also possible to manually train the network by way 
of chosen examples (supervisediearning). In this 
case, it was found that considerable classification 
was possible after showing only one example of 
each of the four main data classes present in the 
data of Figure 1 . Other options include the selection 
of further windows to provide displays of the 
typical inputs corresponding to a given class, 
various classification statistics summaries, and 
displays of information distribution within the 
neurons with the possibility to anneal or trim the 
network for enhanced performance and minimized 
class overlap. 

8. TYPICAL WISDOM PRODUCTS 

In this concluding section, it is shown how such 
a system can be used to give phenomena morphol- 
ogy across the whole data set, and hence, an under- 
standing of the relative importance of any given 
phenomenon. Once an unsupervised ANN has 
identified a series of phenomena classes in a data 
set, it is a simple matter to then process the whole 
data base to generate morphologies of phenomena 
occurrence across the data base. 

Figure 5 shows an example of possible use of 
the future WISE system to understand the morphol- 
ogy of GEOS electrostatic emissions near the 
plasma frequency. Note that in these plots emi s- 
sions have been extracted by a dedicated numerical 
algorithm [Gough et al., 1980], but, from the above 
discussion, ANN would perform with equal satis- 
faction. In the examples shown in Figures 5a and 
5b, 430 days of GEOS II wave data have been 
analyzed. Data are plotted in polar plots against 
magnetic local time (midday at top, dawn to right, 
midnight at bottom, and dusk to left). 

Plots such as these immediately show the data 
analyst where wave emissions are strongest in 
geomagnetic coordinates, how the power varies 
with geomagnetic activity, and how pi asma fre- 
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Figure 4. PC screen showing use of WISE front-end demonstrator. On the left are shown top to bottom: original wave spectrogram. 
I -bit preprocessed data, and the results from applying ANN. Clicking the mouse on any position (on any of these three windows) 
provides in the top right window a closeup view of the selected area, with neuron scoring and subsequent feature class 
identification. The lower right window provides a histogram of occurrence of the feature classes in the data. Pull-down menus 
provide full control over data access, preprocessing, and neural network configuration. 



Figure 5. Electrostatic wave morphology in the GEOS 11 data set — (a) GEOS II wave power plotted radially against LT; (b) GEOS 
II wave frequency plotted radially against LT. 
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quency and hence cold electron density vary with local 
time and geomagnetic activity. An equivalent study by 
a human data analyst would require many intensive 
person-years of effort because of the need to accurately 
identify wave phenomena in such a study. 
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The analysis of space science data often requires re- 
searchers to work with many different types of data. For 
instance, correlative analysis can require data from 
multiple instruments on a single spacecraft, multiple 
spacecraft, and ground-based data. Typically, data from 
each source are available in a different format and have 
been written on a different type of computer, and so much 
effort must be spent to read the data and convert it to the 
computer and format that the researchers use in their 
analysis. The large and ever-growing amount of data 
and the large investment by the scientific community in 
software that require a specific data format make using 
standard data formats impractical. 

A format-independent approach to accessing and 
analyzing disparate data is key to being able to deliver 
data to a diverse community in a timely fashion. The 
system in use at the Planetary Plasma Interactions (PPI) 
node of the NASA Planetary Data System ( PDS ) is based 
on the object-oriented Distributed Inventory Tracking 
and Data Ordering Specification (DITDOS), which 
describes data inventories in a storage independent way. 

The specifications have been designed to make it possible 
to build DITDOS compliant inventories that can exist on 
portable media such as CD-ROMs. The portable media 
can be moved within a system, or from system to system, 
and still be used without modification. Several 
applications have been developed to work with DITDOS 
compliant data holdings. One is a windows-based client/ 
server application, which helps guide the user in the 
selection of data. A user can select a data base, then a 
data set, then a specific datafile, and then either order 
the data and receive it immediately if it is online or 
request that it be brought online if it is not. A user can 
also view data by any of the supported methods. DITDOS 
makes it possible to use already existing applications for 
data-specific actions, and this is done whenever possible. 
Another application is a stand-alone tool to assist in the 
extraction of data from portable media, such as CD- 
ROMs. In addition to the applications, there is a set of 
libraries that can facilitate building new DITDOS 
compliant applications. 


1. INTRODUCTION 

Space physics researchers often are confronted 
with data in a variety of formats. Many information 
system designers also are faced with this problem, 
in both the commercial and scientific communities. 
In the scientific community, there are a number of 
reasons why data obtained from different sources 
use different storage methods. In general, projects 
adopt a formal storage format, but these formats 
differ from project to project. In addition, the binary 
representation of the data can differ among projects. 
These differences occur because different proj ects 
use different computing environments and design 
their formats to address specific needs . For ol der 
missions, the computing environment in the scien- 
tific community may have changed so much that it 
is difficult to transform the original data into a 
usable form. 

There have been attempts in the past to develop 
universal formats or to impose a fixed computing 
environment. These efforts have encountered 
limited success, primarily because of the cost of 
imposing standards retroactively. Restoring, refor- 
matting, and re-archiving of a historical project’s 
data holdings can be a substantial fraction of the 
dollars available for research on the data. Some 
balance between making data readily available and 
easy to use and providing research support must be 
reached. The Planetary Plasma Interactions (PPI) 
node of the NASA Planetary Data System (PDS) 
has been striving to provide easy access to data 
from all the fields and particle instruments on 
planetary missions while preserving the investments 
made in software analysis and research tools by 
individual projects, as well as providing a pathway 
to new technologies and new applications. We are 
doing this by adopting a format-and-operating 
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system-independent approach to accessing and 

analyzing data. 

2. MODEL DATA SYSTEM 

In 1988, when we designed the first version of 
the PDS/PPI node software, we adopted a model for 
a distributed data system that consisted of multiple 
access points to (mostly) centralized data holdings. 
This model was adopted primarily to address slow 
user response times typical when systems with 
centralized access and data holdings were accessed 
remotely. In addition, all data were reformatted into 
a single, common format when it was placed within 
the system. This was effective, but we quickly 
learned that there were important issues this model 
did not address. First, because we used a standard 
data format that was different from the one used by 
the project, when members of the project team 
obtained the data, they could not read the data with 
their existing software. This was very inconvenient. 
To address this we needed to provide some transla- 
tion capability from the standard format to the 
specific format used by the project. Second, we 
found we were spending significant resources 
converting data from a specific format to the 
standard format. These resources included person- 
nel to perform the conversion, space to store the 
converted copies of the data, and the additional 
management overhead of tracking various versions 
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of the same data. Third, by maintaining a central 
data holding, we were not making the most effec- 
tive use of pre-existing resources. We have found 
that by maintaining data holdings within active 
research institutions, the quality of the data is 
increased because the data are being used in re- 
search and problems are corrected as they are 
found. Fourth, while we had an effective method 
for managing our data holdings, researchers still 
had to use different methods to manage the data 
received from us. In some cases, users wanted large 
quantities of data for statistical studies. Our existing 
system provided little support for such endeavors . 

Recently, we used the lessons learned from the 
first version of the PDS/PPI node software to 
design our second generation data system. It is a 
highly distributed data system with distributed data 
holdings and distributed data access points. In this 
system, we decided that the data holdings would 
remain in the data format the project or investiga- 
tion deemed most suitable. Because we must be 
able to handle data in any format, it was not pos- 
sible to include the data within a particular data 
base management system. We needed a standard- 
ized inventory scheme so that individual data files 
could be tracked. 

The overall model for a system designed to 
address these issues is depicted in Figure 1 . In the 
new system model, there are two kinds of servers, 
primary and secondary, and two kinds of clients. 


Model of Second Generation PDS/PPI Data System 



Secondary Servers 


Figure 1 . The data system model used in the design of the second generation PDS/PPI system. 
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primary and secondary. The primary server provides 
two functions. First, it provides standardized access 
to data inventories to facilitate locating the data 
holding of interest. In this function, the primary 
server delivers inventory information in a specific 
format and with a content described by the inventory 
contents specifications (to be defined later) to the 
requesting client. There are no restrictions on how 
the inventory information is physically stored. It can 
be stored in data base management systems or in 
alternative inventory management systems. The 
second function of the primary server is to provide a 
standard method for accessing data holdings. The 
primary server calls upon a secondary server (or 
server agent), which can read data in a specific 
format and deliver a stream of bytes to the primary 
server. The secondary servers are format-specific 
servers that deliver requested data to the primary 
server for transport to a requesting client. The 
primary server takes the stream of bytes produced by 
the secondary server, packetizes it, and delivers it to 
the requesting client. The movement of data from 
the secondary server to the primary server and from 
the primary server to the requesting client is accom- 
plished by using specifically designed protocols. 

Like the servers, there are both primary clients 
and secondary clients. A primary client interacts 


with a primary server by using the primary client/ 
server protocol. The role of the primary client is to 
provide a view of the inventories provided by the 
primary server and to enable the selection of a 
specific data holding. When a primary client is 
handling a data holding request, the data are deliv- 
ered as a packetized stream of by tes, which are 
preceded by information detailing the specific data 
format the stream represents. This data format 
information is used to select a particular secondary 
client (or client agent) to handle the data stream. 
The primary client’s function is limited to 
unpacketizing the data and delivering it to the 
secondary client. 

Like their secondary server counterparts, 
secondary clients operate on data of a specific 
format. Secondary clients are typically identified as 
either a writer or a viewer, depending on the 
destination of the data. A writer typically writes the 
data to a local disk, and a viewer typically displays 
the data on the screen. In some cases, a viewer may 
also serve as a writer — for example, when a viewer 
allows the user to examine or select subsets of the 
data before writing the data to disk. Figure 2 shows 
the information pathways between the primary 
client and server and between secondary clients and 
servers. 


Information Pathways between Primary and Secondary 
Clients and Servers 



Reader 

Secondary \ 
Server 


One or more 
files 


Information 


Figure 2. Information pathways between the primary client and server and secondary clients and servers. The arrows indicate the 
flow direction of information ( data and inventory). 
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The entire system must be portable and scalable 
[King et al., 1993b]. We wanted to be able to 
include with large data collections a self-contained 
data system (portability) with which a user could 
manage data and that is independent of a computer 
and operating system. In addition, we wanted to be 
able to migrate collections of data online and 
offline with a minimal impact on the entire system 
(scalability) [Kingetal., 1993a], 

3. DITDOS DETAILS 

We have defined a set of specifications called 
the Distributed Inventory Tracking and Data 
Ordering Specification (DITDOS), which detail the 
six types of metadata required to manage data using 
our approach. The inventory contents are client/ 
server protocol, secondary server protocol, primary 
server protocol, secondary client/server configura- 
tion, and inventory description. Each of these are 
discussed separately. 

3.1 In ventory Contents Specification 

The inventory contents specification defines the 
data returned by a primary server when an inven- 
tory request is made by a client. The two types of 
inventory information are data base and data set 
information. A data base description describes 
collections of data sets. A data set describes collec- 


tions of data holdings and includes information 
about access privileges and curator information. 
Each data holding is referred to as a member of a 
data set. Because membership in a data set is 
maintained through reference to a physical data 
holding, it is possible for a data holding to be a 
member of multiple data sets without reproducing 
the data. 

While the inventory information has a fixed 
format and structure and was designed to be used in 
a relational data base system, there is no require- 
ment that the inventory information actually be 
stored in a relational data base system. The only 
requirement is that a primary server deliver the 
inventory information in a specific format. Figure 3 
shows the contents of the various relational 
“tables,” called the inventory tables, and how the 
tables interrelate. The contents of the inventory 
tables detail which data sets are available and which 
members exist in each data set. Each set of inven- 
tory tables is referred to as a data base. There are 
four “tables” in a data base: ( 1 ) the “data set” table, 
which defines which data sets are in the data base; 
(2) the “member” table, which defines which data 
files are members of a specific data set; (3) the 
“curator” table, which provides information on who 
contributed the data set; and (4) the “privs” table, 
which describes privileges users have on data sets. 
The detailed structure of each inventory table 
follows. 



Figure 3. The required information in each inventory table and the relation of fields in each table. The lines show dependencies. 
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3.1.1 Data Set Table 

The data set table defines which data sets exist, 
the curator of each individual data set, and a 
description of the data set contents. The structure of 
the data set information is shown in Table 1 . 

3.1.2 Member Table 

The member table defines the members of 
each data set. It classifies the content of the data 
holding and identifies the storage format. It also 
contains a description of the contents of the data 
holding. The structure of the member table is shown 
in Table 2. 


3.1.3 Curator Table 

The curator table provides detailed information 
for each curator reference name. This includes the 
physical mailing address, telephone number, and E- 
mail address. The structure of the curator table is 
shown in Table 3. 

3.1 .4 Privs Table 

The privs table identifies which users from 
which host have specific privileges on a data set. 
These privileges apply to all possible operations 
that can be performed on all member data holdings. 
The structure of the privs table is shown in Table 4. 


Table I. Data Set Table. 


Field Name 

Size 

Type 

Description 

refname 

64 

AlphaNumeric 

The external reference name for the data set. 

curator 

32 

AlphaNumeric 

The reference name of the curator for the data set. 

desc 

512 AlphaNumeric 

A textual description of the contents of the data set. 

Table 2. Member Table. 

Field Name 

Size 

Type 

Description 

data set 

64 

AlphaNumeric 

The external reference name of the data set of which this file is a member. 

sysname 

256 

AlphaNumeric 

The system name (path) to the data holding to which this entry refers. 

type 

32 

AlphaNumeric 

The data content type of a member. This is typically a grouping, such as data, 
document, image, animation, etc. 

status 

64 

AlphaNumeric 

A description of the status of the member file. Words such as “online" and 
“offline” should appear in the status description. 

class 

32 

AlphaNumeric 

A phrase identifying the class of the member file. The class is the name of the 
storage format. 

instance 

64 

AlphaNumeric 

A phrase identifying the instance of the member file. The instance is equivalent 
to the version of a format. 

desc 

512 

AlphaNumeric 

A textual description of the contents of the member. 

Table 3. Curator Table. 

Field Name 

Size 

Type 

Description 

curator 

32 

AlphaNumeric 

The reference name of the curator for the data set. 

name 

64 

AlphaNumeric 

The full name of the curator. 

inst 

64 

AlphaNumeric 

Institution or affiliation name. 

address 

64 

AlphaNumeric 

The full mailing address — for example, street, city, state, zip, and country. 

phone 

32 

AlphaNumeric 

Telephone number for contacting the curator. 

e-mail 

128 AlphaNumeric 

Full E-mail address for contacting the curator. 


Table 4. Privs Table. 


Field Name 

Size 

Type 

Description 

refname 

64 

AlphaNumeric 

The external reference name for the data set for which this set of privileges applies. 

user 

32 

AlphaNumeric 

The name of the user granted this set of privileges.The entry may contain wild cards. 

host 

32 

AlphaNumeric 

The name of the host from which the user must originate for this set of privileges 
to be granted. The entry may contain wild cards. 

priv 

4 

Integer 

A bit map of privileges that are granted. The following fields are defined: 
bit 1: select; bit 2: insert; bit 3: update; and bit 4: delete. All other bit fields 
are undefined. 
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3.1.5 Notes on the Tables 

With the exception of the “sysname” field in 
the member table, all information in a data base is 
an abstraction — that is, it is something that is 
defined to help organize and manage the underlying 
data. For example, a data set name (refname in the 
data set table) can contain any sequence of charac- 
ters. It can be a serial number or some name derived 
by using a particular nomenclature. Unique aspects 
of the inventory data are the “class” and “instance” 
entries in the member table. This information is 
used to determine the specific format of the data 
file and in turn is used to determine which second- 
ary client or secondary server to call to manipulate 
the data. For example, if the class for a member was 
“UCLA/IGPP flat file” and the instance was 
“Version 2.0,” then the format of the member file 
would be the one developed at UCLA recently in 
which the member file consisted of two physical 
files, one a PDS compliant label file and the other 
the data file. 

Another unique aspect of the inventory tables 
involves the use of the “user” and “host” fields in the 
privs table. This table was designed to work easily 
with networks so the user and host fields support the 
use of wild cards. This allows privileges to be 
granted based on the user’s host, or the user name or 
some combination of user and host. For example, if 
the user field was (% is the wild card character) 


and host was “kingsun,” then any user from 
“kingsun” would be granted the associated privilege. 
Possible privileges that may be granted are: SE- 
LECT, INSERT, UPDATE, and DELETE. 

All information provided in the inventory is 
considered opaque (read-only) to the client and is 
used solely for selection purposes. Information 
maintained in the privs table is used internally by 
the server and typically should not be a concern of a 
client. It is expected that only special clients, 
referred to as administration tools, would actually 
alter the content of the inventory files. 

3.2 Primary Client/Server Protocol Specification 

The primary client/server protocol is a bi- 
directional protocol, which uses tagged packets. A 
packet consists of a tag accompanied by data. The 
tag associated with the packet is either a request or 
a response, and the data associated with the packet 
is either additional information needed to complete 
the request or the results of a request when the 
packet is a response. Possible responses from a 
server are shown in Table 5. 

A server may be configured by a client by 
sending a packet tagged at SET. The data portion of 
a SET packet must contain a string with a 
“key word=value” syntax. The current parameters 
that can be set within a server are shown in Table 6. 


Table 5. Primary Client/Server Protocol Response. 

Tag Description Data 


OK Indicates that the request has been received and will be processed. None. 

ERR Indicates that an error has occurred while attempting to act on 

the most recent request. 

DATA Signifies a packet that contains data, which is in response to the 

most recent request. 

OBJECT Signifies a packet that contains information about the subsequent 

data packets. An object packet must precede all data transfers. 

END Indicates the end of the data transfer. The request has been fulfilled. None 

No more data are to follow. 


Text containing a descriptive message 
about the cause of the error. 

The data requested. 

The record from the member table 
corresponding to the member. 



Table 6. Current Parameters. 

Parameter 

Description 

DATAJIYPE 

Defines the type of data of interest. Defining this parameter limits searches of 
the member table to members of the specified type. 

SEARCH_METHOD 

Defines the search method to use when locating records. Possible values are SERIAL 
and GLOBAL. A SERIAL search proceeds through table records sequentially. 

A GLOBAL search proceeds through tables by using a relational model, matching 
all possible occurrences. 

MEMBER_SYSNAME 

Defines the system name of the desired member file. This is required to eliminate 
ambiguities when there are multiple member files with the same data type. 

TIME_FORMAT 

Defines the time format that secondary servers use when returning time values. 
The value assigned to the variable is the “name” of the desired format. Currently, 
17 different formats are defined. See text for details. 
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Supported time formats are: 

• AMERDATE (1/19/93) 

• EURODATE (19.1.93) 

• ABBREAMER (Jan 19, 1993) 

• ABBREURO (19 Jan 1993) 

• LONGAMER (January 19, 1993) 

• LONGEURO(19 January 1993) 

• NUMERICAL (93.019) 

• DAYNUMBER(93 019) 

• JAP AND ATE (93.19.1) 

• NIPPOND ATE (93.1.19) 

• HIGHLOW (93 1 19) 

• 1 SEED ATE (93 19 1 19) 

• DFS_STYLE ( 1993-Jan- 1 9) 

• ABBRDFS_STYLE( 1993/0 1/1 9) 

• PDS_STYLE (199301 19T12:43:36.002Z 

• ISO (199301 19T124336Z) 

• BINARY (8536778 16.0002) 

Each date, with the exception of PDS_STYLE, 
ISO, and BINARY, are followed by a time of day in 
the format HH:MM.SS.mmm( 12:43:36.002). 

Possible requests from a client are shown in 
Table 7. 

The client/server protocol stipulates that when a 
client initiates a connection to a server, it must 
register with the server the name of the user running 
the client and the name of the originating host. The 
protocol also stipulates that only a client can initiate 
a request for information. A server must always 
acknowledge the receipt of all client requests. A 
server’ s acknowledgment can be either “OK” if the 
request was accepted and will be processed or 
“ERR” if an error has occurred and the request will 
not be processed. An additional requirement is that 
a data base must be opened before any queries can 
be submitted. 


3.3 Secondary Server Protocol Specification 

When a secondary server is called by the 
primary server, it is called with a single command 
line argument. That argument is the value of the 
“sysname” field in the member table for the re- 
quested member. In addition, two environment 
variables are set. These are: 

• DITDOS_TIME_FORM AT, which is set to 
a value defined by the client using the 
“SET” packet. 

• DITDOS_QUERY, which is set to an SQL 
query submitted with the QUERY packet. 

An application is not required to use any 
environment variables, which are defined so that an 
application may provide additional services beyond 
data delivery. 

A secondary server accesses the data file 
associated with the passed “sysname,” converts the 
data file into a data stream, and writes the resulting 
data stream to “standard out.” Because the interac- 
tion between the primary server and the secondary 
server is local (i.e., on the same machine), the 
definition of “standard out” is irrelevant. Because 
all access to data by a client must go through the 
primary server, which packetizes data in a standard- 
ized manner, all platform dependencies are trans- 
parent to the client. 

3.4 Secondary Client Protocol Specification 

The secondary client protocol is identical to the 
secondary server protocol with the following 
exception. A secondary client reads data from 
standard in and outputs data to some device (i.e., 
the screen, a disk file, or another process). The 


Table 7. Possible Client Requests. 


Tag 

Description 

Data 

REGISTER 

Signifies a packet which transfers user and host 
registration information. 

The user and host name for the client in the 
format user@host. 

QUERY 

Requests specific information. 

The query for information in SQL format. 

OPENDB 

Requests opening a data base. 

The name of the database. 

CLOSE_DB 

Requests closing the previously open data base. 

None. 

VERSION 

Requests the version number of the server. 

None. 

RESET 

Requests resetting the server to an initial state. 

None. 

DONE 

Instructs the server that the client has completed all 
transactions and is exiting. 

None. 

CANCEL 

Requests cancelling any currently running queries. 

None. 

SET 

Sets specific parameters in a server. 

Parameter setting instructions in the form 
“parameter=value.” 
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definition of “standard in” is platform specific, but 
is transparent to the server because all data must 
pass through the primary client on their way to the 

secondary client. 

3.5 Secondary Client/Server Configuration 
Specification 

Each primary client and server can launch a 
secondary client or server through a simple func- 
tion. This function takes the class and instance 
information, along with the type of client or server 
agent required. Possible choices for agents are 
readers, writers, viewers, and requesters. A reader is 
a secondary server agent, writers and viewers are 
secondary client agents, and a requester is a special 
type of agent that can be used to automate some 
independent action, such as bringing offline data 
online. In this case, a requester could send E-mail 
to a specific individual or could run a process that 
will retrieve the requested file. 

Agent categories are divided into specific types. 
For example, the user may have a graphic viewer 
and a textual viewer. The available agents for a 
particular class and instance are specified in a 
configuration file, which is external to the primary 
client and server. The format of entries in the 
configuration file is: 

class={classname} 
instance= { instance name } 

{ agent } . { type } = { application } 

All {agent}, {type} definitions are associated with 
the preceding class and instance definition. If an 
{ application } does not have a preceding path, then 
the application is expected to be located in the 
installation directory. An example of an agent 
configuration for the UCLA/IGPP flat file is as 
follows: 

class = UCLA/IGPP FLAT FILE 
instance = Version 1.0 

reader . native = ff convert 
writer . native = byte-write 
writer. ascii = f fwrite-ascii 
viewer. text = xffviewer 
viewer . graphic = xff graph 
request . of f line = /usr/ucb/mail 
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This configuration defines one reader, two writers, 
two viewers, and a way to request offline data. The 
reader is a “native” reader, which will read the flat 
file in its native format. The two writers are “na- 
tive” and “ascii.” The native writer will write a flat 
file in its native format, and the “ascii” writer will 
convert the native flat file format to an ASCII 
format. The viewers allow visualizing the data in 
either a graphic or textual format. 

A typical client application interacts with the 
agent configuration file to present choices to the 
user when specific actions are to be taken. 

3. 6 Inventory Descriptions Specification 

Because there are occasions when a user may 
want to transfer a description of a DITDOS inven- 
tory in a portable way, an inventory description 
language was defined. An inventory description is 
an ASCII file containing directives that list member 
files and link them to data sets. The basic syntax for 
the inventory description is: 

define {object} { object option... }, 

where {object} can be “data set” or “member” and 
{object option} is an {object} specific list of 
options. There also are commands for creating data 
bases and for setting the focus for subsequent 
define commands. 

3.7 Environment Variables 

Currently, only one execution time environment 
variable is defined. The name of the variable is 
“DITDOS” and must be assigned the path to the 
installation directory. Client applications use the 
environment variable “DITDOS_OPERATOR” to 
determine whom to send comments. 

4. CONCLUSIONS 

The DITDOS specifications provide a protocol 
for designing portable applications for accessing 
data inventories. The non-intrusive characteristic of 
systems based on DITDOS makes it easy to add 
distributed access to preexisting data systems. In 
addition, the scalability of DITDOS-based systems 
makes it possible to build self-contained data 
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collections that include DITDOS inventories. These 
self-contained data collections can be put on hard 
disk, CD-ROM, or other distributable media. In 
addition, a single method of access can be used 
to obtain data from online data holdings or from 
CD-ROMs. Various applications have been written 
based on the DITDOS specifications and are 
currently in use as the data access system for the 
PPI/PDS node. These include a server for invento- 
ries maintained in UCLA/I GPP flat files, an inven- 
tory description translator, an X-Windows client 
(which provides for data viewing and ordering), 
and command line extraction tools to accompany 
CD-ROMs. 
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C.T. RUSSELL, G.LE, J.G. LUHMANN, and 
B. LITTLEFIELD 

Institute of Geophysics and Planetary Physics, 
University of California, Los Angeles, CA 

The UCLA Space Physics Group has developed educa- 
tional software composed of a series of modules to assist 
students with understanding basic concepts of space 
plasmas and charged particle motion. Present modules 
cover planetary magnetospheres, charged particle mo- 
tion, cold plasma waves, collisionless shock waves, and 
solar wind. The software is designed around the prin- 
ciple that students can learn more by doing rather than 
by reading or listening. The programs provide a labora- 
tory-like environment in which the student can control, 
observe, and measure complex behavior. The interac- 
tive graphics environment allows the student to visualize 
the results of his or her experimentation and to try 
different parameters as desired. The current version of 
the software runs on UNIX-based operating systems in 
an X-Windows environment. It has been used in a 
classroom setting at both UCLA and the University of 
California at San Diego. 


1. INTRODUCTION 

Laboratory courses traditionally have been an 
essential adjunct to classroom instruction in the 
physical sciences. Many principles are abstract until 
the student can experience the effects of those 
principles in a hands-on experiment. Moreover, 
many students leam better by doing rather than by 
reading or listening. The laboratory course strongly 
reinforces the book learning. In space physics, it is 
difficult to construct a traditional laboratory course 
because the size and expense of the apparatus 
necessary to undertake properly scaled experiments 
would be prohibitive. Fortunately, computers 
provide us with a mechanism to supplement the 
lecture course in a meaningful way. To support the 
classroom instruction in space physics at UCLA, we 
have developed a series of software modules that 
illustrate various concepts related to space plasmas. 
At this writing, we know of no similar available 
software for space physics. These modules provide 
an experimental experience with physical processes 
that cannot be studied in the laboratory. They allow 
the student to compare theoretical and model results 
with “observations” obtained using these modules 
and to visualize complex physical processes. They 
reinforce the classroom instruction and develop 
physical intuition. 

Five modules are presently available. While 
designed as an adjunct to a lecture course in space 
physics, some of these modules would be useful in 
teaching more basic plasma physics concepts. The 
magnetospheres module takes the student through 
various magnetic field models of the Earth’ s 
magnetosphere and compares the dipole fields of 
each of the magnetized planets. The particle tracing 
module allows the student to follow the motion of 
ions and electrons of varying energy in magnetic 
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and electric fields of varying geometry. The cold 
plasma wave module allows the student to examine 
the behavior of electromagnetic waves in a cold 
plasma. The solar wind module illustrates the 
radial variation of the solar wind and interplanetary 
magnetic field, how the solar current sheet affects 
the structure of the interplanetary magnetic field, 
and how disturbances propagate through the inter- 
planetary medium. Finally, the collisionless shock 
module allows the student to calculate how the 
solar wind plasma and magnetic field vary across a 
collisionless shock as the parameters of the shock 
and the solar wind vary. These modules will 
continue to be enhanced and new modules added 
during the foreseeable future. In particular, we 
expect soon to develop a current system module 
that illustrates the effect of currents flowing in 
different parts of the Earth’ s magnetosphere. 

The development of software such as this is a 
complex and expensive undertaking. The programs 
described below were developed beginning in the 
summer of 199 1 with the support of the National 
Science Foundation and NASA under the Space 
Grant Universities Program. In the following 
sections, we describe the programming environment 
and then each of the existing modules in turn. 

2. PROGRAMMING ENVIRONMENT 

The Space Physics Education Software was 
developed to run on Sun Sparc workstations using 
C, X-Windows, and Openlook. The software can 
be run “remotely” at our site at UCLA and dis- 
played on any suitable local machine’ s display. 

The only requirement is that the local machine runs 
X-Windows. This includes Macintoshes, which run 
the Mac-X program . Local copies of the software 
can be made easily as well. Only two files need 
revision to create an executable on a new system. 
These two files are the Makefile and the X-Win- 
dows resource file , Please contact the lead author 
(ctrussell@igpp.ucla.edu) for information on using 
the code remotely or how to obtain a copy of the 
code. 

Portability was kept in mind during develop- 
ment, so the program can run on different UNIX 
platforms and X-Windows environments. Cur- 
rently, the software can be run on Sun workstations 
with either Sun’ s or GNU’ s C compiler for added 


portability. Plans are to add Motif support to the 
program soon and to port the program to other 
vendor platforms as needed. 

3. THE LEARNING MODULES 

3.1 Magnetospheres 

The magnetospheres module was designed to 
introduce students to some of the elementary 
properties of planetary magnetic fields. The first 
exercise introduces the dipole magnetosphere of 
each of the planets. The student can “fly through” 
the magnetic field using the mouse to take measure- 
ments at any chosen position. The next exercise 
allows the student to examine the properties of the 
“mirror-dipole ” magnetosphere of Chapman and 
Ferraro and to see in a simple analytic model how 
the magnetosphere is compressed by the solar wind. 
The next level of complexity is the spherical 
magnetosphere, in which the equatorial field near 
the boundary is tripled because of the highly curved 
spherical magnetopause surface. This can be 
compared in the next exercise to Tsyganenko ’s 
[1989a ] vacuum magnetosphere which is confined 
inside an elongated ellipse. With its realistically 
shaped dayside magnetopause, the field at the nose 
in this model is compressed by a factor of 2.44 over 
the undisturbed dipole field at this distance. The 
final exercise allows the student to probe 
Tsyganenko ’s [1 989b ] empirical magnetosphere, 
which is distorted relative to the vacuum magneto- 
sphere because of the implicit presence of internal 
plasma. Figure 1 shows a copy of the window 
displaying this last option, with a train of mouse- 
selected position points shown on a “flight” through 
the magnetotail. 

3.2 Particle Motion 

The particle motion module was designed to 
demonstrate the behavior of single charged particles 
moving in magnetic fields of geophysical interest. 
On entering the module, the user is presented with a 
menu of field geometries, including Uniform, 
Harris Sheet , Dipole, Mirror, Gradient, and Curva- 
ture magnetic geometries and an ExB option that 
includes a uniform electric field together with a 
uniform magnetic field. The Harris current sheet is 
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Figure 1. Display from the magnetospheres module of the space physics program. The empirical model ofTsyganenko is shown. 
The x’s are points at which the user has made measurements of the magnetic field using the cursor and the mouse. The results 
of the last measurement are given in the boxes on the left side of the display. 


a simple analytic current sheet of finite thickness 
that approximates Earth’ s plasma sheet [Harris, 
1962], The user can then choose the desired 
particle mass (and charge), including H+, He+, 

He ++ , 0+, H - ’ and “heavy” electrons. The particle 
can be started anywhere within the box showing the 
field geometry in three views by the manipulation 
of slider bars. Slider bars are similarly used to 
select the initial velocity vector components. The 
particle trajectory is then calculated by a standard 
finite difference algorithm for initial value prob- 
lems. The tracing is terminated by use of a “pause” 
button. The trajectory can then be erased with 
another button and restarted as desired, or new 
trajectories can be superposed on the old using 
different selectable colors. Figure 2 shows a 
display for the dipole field option. 

As the trajectory is calculated, the display in the 
upper right-hand corner shows the constantly 
updated particle energy, the accumulated time, the 
first and last pitch angles, and minimum and 
maximum positions and velocities. These allow the 
user to carry out quantitative “experiments,” such 
as verifying the conservation of energy and adia- 
batic invariants, determining how mass and charge 


affect drift motions, and so forth. In some options, 
parameters of the field model can be varied (e.g., 
the Harris Sheet thickness) so that the user can also 
obtain a feeling for how field strength, scale sizes 
of gradients and current sheets, and other modifica- 
tions can affect the particle motion. When the 
particle motion has stopped, the user can measure 
positions on the screen with the mouse-driven 
cursor. These measurements appear at the bottom 
of the screen. 

3.3 Solar Wind 

The solar wind module was created to commu- 
nicate concepts related to solar wind and interplan- 
etary magnetic field behavior. On entering this 
module, the user chooses either the Parker Spiral or 
Heliospheric Current Sheet option. The Parker 
Spiral option allows the user to observe how radial 
motion of solar wind plasma can lead to the spiral 
interplanetary field geometry [Parker, 1958], The 
user can use the slider bars to observe how the field 
geometry is affected by varying the solar wind 
velocity (or solar rotation rate) in either the equato- 
rial plane or at a user-selected heliolatitude. The 




266 


Visualization Techniques in Space and Atmospheric Sciences 


UCLA IGPP Space Physics Program : Particle Motion : Dipole Magnetic Field 


Print/Exit Magnetospheres 7^ 


Cold Plasma. RanMne-Hugcmiot ^ Solar Wind V J 


Start Trajectory 


Continue Drawing 


j 


Erase Graphs 


Draw Field Lines 


H * 

He+ 

He++ | 0+ | Electron | H- j 


j Red 

| Blue 

| Green | Magenta | Black | 


Starting Position x (km): 30 
Starting Position y (km): 30 
Starting Position z (km): 0 
Velocity x (km/s): 10 
Velocity y (km/s): 10 
Velocity z (km/s): 10 



First & Last Pitch Ang: |S4.73B 


9 

X Max & Min Position: | .45.02 

|| .17.02 

□ 

Y Max & Min Position: | .30.64 

■ld£JI 

D 

Z Max & Min Position: | .12.64 

“1| -12.84 

□ 

X Max & Min Velocity: | .17.23 

11 -17.28 

□ 

Y Max & Min Velocity: | .17.22 

□ I' 17 - 25 --- 

□ 

Z Max & Min Velocity: | .is.is 

« mum 

m 

F" YZ- PLANE 


Particle Type: |h+ 

Particle Mass(amu): |i .000 

Particle Charge(e): |i.ooo 

Particle Energy(eV): 1 1 „S656e+QQ 

Time(Min:Sec.hndths): 





X position-1: | +29.848 | X positlon-2: j *44.923 
Y position- 1: [ *30.154 j V po$ilion-2: [ +5,848 


X position- 1: | +1B.154 
2 position-1: | -6.1 54 


X position- 2: j +28.615 
2 posllion-2: | -0.923 " 


Y position-1: | +30.769 | V positlon-2: j +17.231 
Z position-1: 1 +0.615 1 2 position-2: j +0.615 


To Start A Particle Trajectory Press The “Start Trajectory" Button. 

Use Slider Bars To Modify Input Values and Buttons To Control Graphs, Or Select A Help Menu Option For More Info. 


Figure 2. Display from the particle motion module of the space physics program. The particle-tracing exercise in a dipole field 
is shown. The boxes in the upper right update as the particle moves. The boxes on the bottom show positions measured by the 
user using the mouse and cursor. 


garden hose angle and field strength and compo- 
nents also are displayed as a function of heliocentric 
distance. Planet locations are indicated on those 
displays to give the user a sense of the radial 
evolution of solar wind properties. Figure 3 illus- 
trates the screen for the three-dimensional Parker 
spiral choice. 

The Heliospheric Current Sheet option allows the 
user to “design” a heliospheric current sheet shape by 
combining magnetic axis tilt with the introduction of a 
quadrupolar contribution to the solar dipole magnetic 
field. The three-dimensional interplanetary current 
sheet shape is then computed and displayed at a user- 
selected perspective. Solar wind velocity can be varied 
by use of a slider bar. The displays together illustrate 
how the “ballerina skirt” model of the heliospheric 
current sheet is controlled by the shape of the neutral 
line at the solar “source surface.” The associated 


Stream Interaction option allows the user to impose a 
specific heliomagnetic latitude profile of solar wind 
velocity. It directs the program to compute an approxi- 
mate model [Hakamadaand Akasofu, 1982] ofthe 
distortion ofthe equatorial interplanetary magnetic 
field, given that velocity profile and the shape of a 
user-specified current sheet at the source surface. 
Associated model “time series” of solar wind proper- 
ties at various heliocentric distances also can be 
displayed. 

3.4 Cold Plasma Waves 

The cold plasma waves module was designed to 
introduce students to electromagnetic waves in a 
cold plasma [Stix, 1 962] . On entering the module, 
the user is presented with a menu of wave proper- 
ties: Index of Refraction, Dispersion Relation, 
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Figure 3. Display from the Parker Spiral option of the solar wind module. Slider bars allow the user to change the velocity of 
the solar wind, the solar rotation rate, and the heliolatitude. The maximum radius slider bar allows the user to select the range 
covered by the bottom two panels. 


Phase Velocity , Group Velocity , Parallel Group 
Velocity, Perpendicular Group Velocity , Ellipticity, 
and Wavelength. The user can then choose to 
display how a particular wave property varies as a 
function of either the frequency or the propagation 
angle (the angle between the wave vector and the 
background magnetic field). 

Figure 4 shows an example of a “Phase Veloc- 
ity” option screen. Slider bars are provided in the 
screen display that allow the user to select the 
background magnetic field strength and the plasma 
density. To view how the wave property varies as a 
function of these parameters, the user can select 
new values by moving the slider bars and pressing 
the “Draw Graph” button. Different colors are used 
to represent different wave modes (left-handed or 
right-handed). If the “NORMALIZED” button is 
chosen, the frequency is normalized by the proton 


cyclotron frequency, the velocity is normalized by 
the Alfven velocity, and the wave vector is normal- 
ized by the inverse of the ion inertial length (the 
velocity of light divided by the ion plasma fre- 
quency in radians per second). 

To view how the wave property varies as a 
function of a variable, such as propagation angle, the 
user can select a frequency by either clicking the 
mouse button in the lower-right box or manually 
typing in the upper-left box, and then pressing the 
“Draw Graph” button. The lower-left box displays 
the results in a polar plot with the background 
magnetic field in the vertical direction. The user 
also can make a single-point measurement for any 
desired frequency, propagation angle, and plasma 
conditions. The wave properties displayed in the 
upper-right box correspond to the parameters in the 
upper-left box. 
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Figure 4. Display from the Phase Velocity option of the cold plasma module. The panel on the left shows a polar diagram of the 
phase velocity of left-handed and right-handed waves as a function of the angle of the wave normal to the magnetic field. The right- 
hand panel shows the phase velocity at fixed angle of propagation as specified by the slider bar versus frequency, here normalized 
by the proton gyrofrequency. 


3.5 Collisionless Shocks: Rankine-Hugoniot Relations 

The Rankine-Hugoniot module was designed to 
illustrate how the properties of a plasma, such as 
density, temperature, and magnetic field, change 
across collisionless shocks. The Rankine-Hugoniot 
conservation relations, which are incorporated into 
this module, allow for predictions of the properties 
of the downstream plasmas to be made based on 
knowledge of the strength of the shock and up- 
stream conditions [TidmanandKrall, 1971], In the 
Graphs section of this module, illustrated by Figure 
5, the jump in number density, magnetic field 
strength, temperature, and plasma beta can be 


calculated as a function of one of the controlling 
upstream parameters. One can vary the Mach 
number that measures the strength of the shock, the 
plasma beta that measures the ratio of the thermal to 
magnetic pressure, the angle between the upstream 
field and the shock normal, and the polytropic index 
in the ideal gas low. By selecting None, the user 
can find discrete values for the jumps in the plasma 
quantities for a given set of upstream parameters. 
The Case Studies section of this module allows the 
user to enter dimensional quantities for the plasma, 
such as velocity, number density, magnetic field 
strength, temperature, and so on, and obtain the 
downstream values for these quantities . 
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Figure 5. Display from the Graphs option of the Rankine-Hugoniot module. Here, the user has selected to display the variation 
of shock jump parameters versus Mach number. 


4. OTHER SOFTWARE USED IN UCLA 

SPACE PHYSICS COURSES 

Interactive, menu-driven graphics software is 
used both as a tool for graduate students in their 
dissertation research and also in computer labora- 
tory exercises to introduce students to the physical 
processes and phenomena in space plasma physics. 
These software tools allow immediate access and 
display of the data and facilitate the application of 
standard analyses to the data, such as minimum 
variance analysis; Fourier analysis, and filtering 
[Russell, 1983], 

Modern commercial mathematical packages are 
now available that facilitate the manipulation of 
mathematical expressions, the integration of func- 
tions, the solution of differential equations, and the 
display of the results of these manipulations. At 
UCLA, we have used Maple (Waterloo Maple 
Software, info @ maplesoft.on.ca) in computer 
laboratory exercises and find that students often 


extend their use of this software beyond the class- 
room setting. 

5. CONCLUDING REMARKS 

From our experience at UCLA, interactive, 
menu-driven graphics software is a good way to 
introduce students to the physical processes occur- 
ring in space plasmas. We have developed modules 
for magnetospheres, particle motion, cold plasma, 
solar wind, and collisionless shocks. These mod- 
ules need extension, and we need more modules to 
provide a more complete curriculum. They are now 
being used in both upper division and graduate 
classes at UCLA and have recently been introduced 
at the University of California at San Diego. The 
modules have been best received in computer 
laboratory situations where the instructor is avail- 
able to answer questions. Remote dial-in usage has 
been less successful principally because of interfac- 
ing problems with X-Windows emulators. Gradu- 
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ate students prefer remote dial-in capability because 
it allows them the freedom to arrange their sched- 
ules, but such freedom comes at the price of de- 
creased interaction with the instructor. 

We welcome other users. We also welcome 
new ideas for modules, and, especially, we wel- 
come assistance in developing modules. Neverthe- 
less, despite our success to date, some problems 
remain. First, developing software is expensive. 
Second, because graduate students and outside 
users want to run software on a variety of plat- 
forms, portability is critical. Fortunately, current 
developments in computer software and operating 
systems may assist in mitigating, if not completely 
solving, these two problems in the coming years. 
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The National Geophysical Data Center has the largest 
collection of geomagnetic data from the worldwide 
network of magnetic observatories. The data base 
management system and retrieval/display software have 
been developed for the archived geomagnetic data 
(annual means, monthly, daily, hourly, and 1 -minute 
values) and placed on the center’s CD-ROMs to pro- 
vide users with “user-oriented" and “user-friendly” 
support. This system is described in this paper with a 
brief outline of provided options. 
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The National Geophysical Data Center 
(NGDC) has collected geomagnetic data from the 
worldwide network of magnetic observatories for 
most of the last four decades. NGDC has 
accumulated the world’ s largest collection of 
magnetograms on microfilm and microfiche, 
publications, hourly mean value tables, and digital 
geomagnetic data on various media, including 
magnetic tapes, optical erasable discs, and CD- 
ROMs. The NGDC CD-ROMs with solar- 
terrestrial physics data were issued in 1987 and 
1990. The database management system and 
retrieve-and-display software have been developed 
for these CD-ROMs, including an access to all 
kinds of the archived geomagnetic data (annual 
means, monthly, daily, hourly, and 1 -minute 
values) to provide users with “user-oriented” and 
“user-friendly” support. The current version of 
software is oriented to support the hourly and 1 - 
minute data bases, and it provides daily, monthly, 
and annual averages from the hourly mean values 
(Figure 1). 

The “hourly ” geomagnetic software does the 
following: 

• Displays the geomagnetic observatory 
catalog [Abston et al., 1985] — observatory 
name, three-letter observatory code, two- 
letter country code, and geographic coordi- 
nates (Figure 2) 

• Displays the monthly data availability 
information table — that is shows the years 
and months of records of a given observa- 
tory that are available in the hourly means 
digital database, such as CD-ROM (Fig- 
ure 3) 

• Creates a subset of selected data in a flat 
ASCII file (Figure 4) and provides an 
option to sort the data from the standard 
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“ 1 month/one component” sequence to the 
“1 day/all components” sequence, which- 
ever is more convenient for scientific 

analysis 

» Displays the hourly data file day-by-day or 

provides multi-day graphics (up to 3 1 
days) — the latter option provides a monthly 
“stack-plot” magnetogram of all compo- 
nents for a given month (Figure 5), and 3 1 , 
30, 29, or 28 days will be shown for the 
corresponding months 

• Allows changes to the graphic and back- 
ground colors for the convenience of the 
output to a color printer, shows an average 
value of each component for the selected 
interval, allows changes to the scale factor 
of the components, and views the file 
forwards and from the beginning 

• Retrieves the hourly mean values from the 
original file to compute the daily and 
monthly mean values (in the separate flat 
ASCII files) and displays daily values for 
the entire year to show the annual variation 
of the geomagnetic field (Figure 6) — this 
option helps carry out quality control of 
geomagnetic data and conduct the scientific 
analysis of secular variation of the geomag- 
netic field 

The “1 -minute” geomagnetic software retrieves 
and creates a subset of selected data in a flat ASCII 
file and shows the data in a 24-hour group (Fig- 
ure 7). The graphic mode provides the same visual- 
ization service as described above for the hourly 
mean values. 

The retrieval/display software has been devel- 
oped as a set of independent executable modules, 
which can be easily replaced by new ones. There- 
fore, it is easy to add new options for the data 
visualization service. The Geomagnetic Data Base 
Management System consists of the following 
independent modules : 

3 Main data base management system pro- 

gram (run first) 

• Hourly value retrieval program 

• Module to sort hourly value data in the 
sequence “all components for 1-day” 

• “Hourly averaged” daily variation graphic 
program (displays data from 1 through 3 1 
days on the screen) 


• “Daily averaged” annual variation graphic 
program (daily mean values are read from 
the hourly value data file) 

• One-minute value retrieval program 

• “One-minute averaged” daily variation 
graphic program 

» Documentation file 

All programs are written in MS-DOS C lan- 
guage (Version 6.00A) and may be run on the 
widely used 286, 386, or 486 personal computers. 
Retrieval modules create intermediate files INS and 
INSM in the current directory, which contains the 
names of the created output files (subsets of the 
hourly value and 1-minute value data, respectively). 
The graphic visualization modules use these files to 
display the hourly value and 1 -minute value daily 
or annual variations. These graphic modules can be 
used separately from the data base if the names of 
displayed data files are placed in the INS or INSM 
files by the standard text editor. File LCOLOR is 
created from the first run of any graphic program. 
The file contains the number “1” to plot the color 
graphics over the black background of the screen. 
By changing this number to “2,” the color graphics 
will be plotted over the white background; “0” 
displays the white graphics on the black back- 
ground. This option is a menu choice, but the 
numbers also can be placed in the LCOLOR file by 
the standard text editor. 

The hourly values must be written in the IAGA 
format — 1 20 ASCII characters per logical record, 
or 62 binary bytes of the NGDC-05 CD-ROM 
format. The “hourly” software is used for the 
current management of the NGDC digital hourly 
value geomagnetic database on CD-ROMs and 
optical-erasable discs. The “1 -minute” data must be 
written in the WDC-A format — 400 ASCII charac- 
ters per logical record. 

The “1 -minute” software has been selected as a 
primary software set for future NGDC geomagnetic 
CD-ROMs, which will contain 1 -minute 
geomagnetic data collected in the framework of 
STEP Project 6.4, “Global Geomagnetic Database 
for STEP.” The first NGDC/STEP CD-ROMs could 
contain 1 -minute geomagnetic data for 1990-1991 
from up to 50 to 60 magnetic observatories 
worldwide [Papitashvili et al., 1993]. Eventually, a 
set of CD-ROMs will cover the entire STEP period, 
1990 to 1997. This effort is consistent with the 
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approach to bring the “Personal Data Center” on 
CD-ROMs (i.e., the World Data Center products) to 
scientists in any comer of the world. The NGDC 
CD-ROMs and accompanying software are 
available from the National Geophysical Data 
Center (Boulder, CO) at a minimum cost to cover 
only shipping and handling. 


REFERENCES 

Abston, C.C., N.E. Papitashvili, and V.O. Papitashvili, 
Combined International Catalog of Geomagnetic 
Data, Report UAG92, 291 pp.,NOAA/National 
Geophysical DataCenter, Boulder, CO, 1985. 

Papitashvili, V., M. Teague, E. Friis-Christensen, and J. 
Allen, STEP Project 6.4 Plans to Establish One- 
Minute Geomagnetic Database, Part 2, STEP Interna- 
tional, 3, 1,6-9, 1993. 


WELCOME TO THE GEOMAGNETIC DATABASE 
Version 4.0 MANAGEMENT SYSTEM February 1993 

HOURLY VALUES DATA BASE: 

Retrieval and File Output Mode 

Sort Mode of Selected Data_File: One Day-All Components 
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Figure 1. Geomagnetic Data Base Management System main menu. 



Figure 2. Listing of the observatory names and three-letter codes, two-letter country codes, and geographic coordinates. 
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WELCOME TO THE HOURLY VALUES DATABASE 
RETRIEVAL SYSTEM 
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Figure 3. Monthly data availability information table. 
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Figure 4. Time interval selection menu. 
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Figure 6. Daily averaged annual geomagnetic field variation — Boulder magnetic observatory, 1989. 
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Figure 7. One-minute geomagnetic data display — Boulder magnetic observatory, October 21 , 1989. 
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The management of large geophysical and celestial 
data bases is now, more than ever, the most critical 
path to timely data analysis. With today ’s large volume 
data sets from multiple satellite missions, analysts face 
the task of defining useful data bases from which data 
and metadata (information about data) can be ex- 
tracted readily in a meaningful way. Visualization, 
following an object-oriented design, is afundamental 
method of organizing and handling data. Humans, by 
nature, easily accept pictorial representations of data. 
Therefore graphically oriented user interfaces are 
appealing, as long as they remain simple to produce 
and use. The Visual Interface for Space andTerrestrial 
Analysis (VISTA ) system, currently underdevelopment 
at the Naval Research Laboratory ’s Backgrounds Data 
Center (BDC), has been designed with these goals in 
mind. Its graphical user interface (GUI) allows the 
user to perform queries, visualization, and analysis of 
atmospheric and celestial backgrounds data. 


1. INTRODUCTION 

1 . 1 The Data Center Challenge 

Geophysical and astronomical data centers 
throughout the world are currently facing the 
challenge of how to properly manage large-volume 
data from multiple satellite missions in an efficient 
manner. With the advent of remote sensing instru- 
ments producing gigabytes of data a day, both 
computer scientists and analysts are left with the 
realization that data collection technology is 
outpacing data archival and access technology. 
Scientists, programmers, and data base analysts are 
therefore presented with the problem of defining 
more useful scientific databases from which one 
can readily extract data and metadata (information 
about data) in a meaningful way. 

At the Backgrounds Data Center (BDC), one of 
the main charters is to catalog, access, and view 
mission information pertinent to each experiment 
data base. BDC is responsible for handling DoD 
and non-DoD remote sensing terrestrial and celes- 
tial backgrounds data. A majority of the software 
development time at the data center involves the 
design and implementation of on-line catalog 
systems. The initial catalogs have been built using a 
relational data base management system (RDBMS) 
called INGRES®. Relational tables containing 
critical experiment metadata are created and ma- 
nipulated through the use of standard Structured 
Query Language (SQL) commands using a forms- 
based user interface (FBUI). 

An example of an online catalog is the BDC 
Summary Catalog, which enables a user to build 
temporal, spatial, and spectral (TSS) queries, run 
them against all available INGRES tables, and view 
the resultant items in a list box. A user works 


280 


Visualization Techniques in Space and Atmospheric Sciences 


through a series of text windows, using the key- 
board and specific function keys to build the query 
requirements. The idea at the summary catalog 
level is to help a user properly evaluate the data sets 
that match certain criteria and to even perform a 
basic cross correlation of results. The Summary 
Catalog, however, only addresses the metadata of 
these experiment data bases, not the actual data. It 
contains mission-summary-level information, such 
as platform instrumentation specifications, space- 
craft positional information, and observation events 
for each BDC mission. Data are not directly acces- 
sible. However, information relative to the actual 
data is provided, including a reference to the 
physical archive so a user can place an order for the 
actual data items. 

For more specific information about experiment 
data, BDC created detailed index catalogs for 
certain missions. These catalogs, called Program 
Catalogs, enable a user to access lower level 
metadata (data item specific) using a series of FBUI 
text windows in a fashion similar to the Summary 
Catalog. However, just like the Summary Catalog, 
the Program Catalogs only reveal detailed metadata 
results from data, not the actual data. These cata- 
logs are sufficient for certain purposes, such as 
miss ion correlation and analysis of indexed results 
versus time. However, if a user wants to view 
metadata information and resultant parameters (e.g., 
graphs, line plots, and graphic windows) or work 
with the actual data, other tools must be used. 

1.2 The VISTA System 

B DC began developing a tool to provide 
visualization of satellite viewing scenarios. The tool 
grew into a more sophisticated system that not only 
provided a graphical view, but used the graphics to 
actually build, run, and view query results from the 
catalogs. The Visual Interface for Space and 
Terrestrial Analysis (VISTA) took a step beyond 
the FBUI methods of catalog access by including 
visualization and direct data access. VISTA pro- 
vides a visual query tool for access to the BDC data 
bases. This, in itself, is a distinct advantage over the 
more traditional FBUI catalogs, but the functional- 
ity of VISTA did not end there. VISTA also added 
the ability to directly access data as selected from 
metadata lists within the same interface. Hence, 


VISTA achieved the goal of being a fully integrated 
catalog system — that is, able to access and visualize 
both metadata and data. 

Visualization, using an object-oriented design, 
is a fundamental way of organizing and handling 
metadata and data. Humans, by nature, assimilate 
data more rapidly by visual means than any other. 
VISTA has been designed to meet the challenge of 
bringing graphical catalog access and data visual- 
ization to users. This design has resulted in a 
graphical system pertinent to geophysical and 
celestial backgrounds data in which direct visual- 
ization and information about the data are put in the 
context of how they were obtained. In addition, the 
VISTA system has also met the challenge of how to 
efficiently manage the metadata and data files 
themselves. The VISTA design, interface and 
method of data base management is discussed in 
this paper. For additional background material on 
VISTA and BDC, refer to Dombrowski et al. 
(1992a, 1992b, 1993, 1994) and Snyder et al. (1991, 
1993). 

2. VISTA DESIGN 

2.1 Software Design 

The VISTA prototype system was developed 
under the XI 1 Window System (OSF/Motif) using 
the C programming language on a UNIX platform. 
The majority of the VISTA windows are strictly X- 
Windows. However, because the prototype devel- 
opment was initiated on Silicon Graphic, Inc. 

(SGI), workstations, the current interface also 
makes use of GL windows for some of the two- and 
three-dimensional interactive windows. 

The system architecture consists of several 
major software layers. The top layer, User Inter- 
face, makes full use of the XI 1 Window System 
using the Motif window manager and C. SGI’s GL 
is also used, as mentioned above, but these win- 
dows are encapsulated within X-Windows frames. 
The User Interface is described in greater detail in 
section 3 . The second layer is Session Management, 
which handles all sessions parameters such as user 
identification, user area access, metadata file 
location, and pointers to system files. The third 
layer. Access, handles two main object types: User 
Management Objects and Query Objects. The 
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Management Objects consist of files created as a 
result of a user session. The Query Objects are 
parameters that are built and saved as part of the 
user query. The bottom layer. Data Management, is 
split into two separate systems: an RDBMS and a 
data structure flat file management system. Both 
systems use the same top three levels. However, 
they are both uniquely tied into the VISTA inter- 
face system. 

2.2 Data Management Systems 

The VISTA system has been developed with 
two optional data base management systems : an 
SQL/INGRES-based RDBMS and a data structure 
flat file management system. The latter is the 
portable version that has been the focus of the 
prototype system. The former is one that works 
more specifically for BDC RDBMS catalog tables, 
but it also can be used to support similar structures 
elsewhere. In fact, it is the Catalog Link section of 
VISTA (see section 3.2) that should expand the use 
of this version because a multitude of useful data 
bases now exist under an RDBMS. Therefore, at 
other data analysis centers, VISTA could be ex- 
tended to support various data base management 
systems and the information they hold. 


The portable version of VISTA currently works 
with data structures that make use of metadata fiat 
files and directories containing the actual data 
online. Part of the design of the system allows the 
use of routines inherent in the INGRES-dependent 
version in retrieving and building the metadata flat 
files for the portable version. Therefore, additional 
mission information fed into the INGRES catalog 
systems at BDC or elsewhere can be queried and 
used to populate VISTA flat files for the portable 
version. 

3. VISTA PROTOTYPE INTERFACE 
3.1 The Main Window 

The VISTA Main Window (see Figure 1) is the 
focal point of the system. The window displays four 
main user access areas: Catalog Link, Build Query 
(and Run Query), Edit/View, and Analyze. Each 
area is color coded to help the user move through 
the interface more rapidly. Multiple windows 
opened in an area are easily recognized by the 
color-coded bars at the top of the window. These 
four main VISTA areas are discussed in separate 
sub-headings in a fashion parallel to how a user 
would activate the interface. 



Figure 1. VISTA Main Window. 
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A fifth area of the Main Window associated 
with the four main areas is the VISTA Workspace. 
This display window is the area where users can 
save particular objects (see Figure 1) as produced 
from one of the main areas. For instance, a user in 
Build Query may want to save the parameters 
created in a TSS query. Within Build Query, there 
are options to allow the user to select this save 
mechanism. A specific color-coded object corre- 
sponding to the saved query is placed in the VISTA 
Workspace area. This object contains all saved 
query parameters and is tagged with a specific 
filename for retrieval and use within the interface. 
Each object has an associated pull-down menu that 
allows the user to activate, rename, copy, or delete 
the object at any time. The VISTA Workspace area 
is similar to PC and workstation environments that 
rely on visual icons for files and executables. By 
using the Workspace, a user can enter and exit the 
VISTA system at any time while preserving the 
information created within a particular area of the 
Main Window. 

3.2 Linking to the Science Catalog 

The first button on the left of the Main Window 
is Catalog Link. In the prototype, this button is 
inactive, hence grayed-out. In future versions, this 
button will provide access to other catalog inter- 
faces and be a mechanism for the retrieval of 
pertinent metadata from sources other than the 
VISTA internal data base. For instance, by pressing 
the Catalog Link button, users will attain direct 
entry into the BDC Summary and Program Catalog 
interfaces and the BDC Science Catalog Informa- 
tion Exchange System (SCIES), and they have 
direct access to Internet, FTP services, and 
hypertext tools such as Mosaic®, which was created 
by the National Center for Supercomputing Appli- 
cations (NCSA). 

3.3 Building Queries 

The second button from the left, Build Query, 
provides direct access to a series of windows 
allowing a user to specify query parameters to run 
against various mission sources (data bases). When 
a user activates the Build Query button, the Build 
Query control panel appears (see Figure 2). This 


panel consists of three main areas: Source List, 
Query Parameters, and Run Query. The Source List 
area consists of a box presenting all available data 
sets from which to query against. The list presents 
the names of the missions supported at BDC. From 
the list, a user can select either an entire mission or 
sub-select a particular instrument or observation, 
and so on, depending on the most logical character- 
ization of the mission parameters. A user can also 
select multiple missions or portions of mission data 
for cross-correlation purposes. This is the real 
power of multiple or sub-sampling source selection 
list boxes, allowing a user to perform various cross- 
correlational studies. 

The Query Parameters section of this control 
panel is where a user builds the detailed TSS query 
specifications using both listboxes and activation 
buttons. Queries are built by either directly typing 
them in or by using graphical windows to delineate 
specific query parameters. For each TSS specifica- 
tion, there is an associated window activation 
button. For example, for spatial specifications, a 
user activates either Geographical (Earth observing 
systems) or Celestial (space observing systems). 
These graphical windows allow the user to specify 
multiple spatial queries. The Geographical Query 
window (as shown in Figure 3) allows a user to 
specify Earth-based spatial ranges. The Geographi- 
cal window provides a map view of the Earth and 
allows the user to move around, using slider bars, in 
latitude and longitude. 

V arious control buttons in the Geographical 
Query window enhance the area selection process. 
For instance, the user has the ability to zoom, 
change the map selection, or create a query area. 
When a user wants to create one or more areas in 
latitude and longitude, the Create Area button is 
selected. Input parameters from the geographical 
map view are saved in the Geo Area located below 
the map view (see Figure 3). If a user wants to 
specify an altitude range for the data query, the 
altitude bar to the right of the geographical map 
view is repositioned. This bar mechanism allows 
the user to set a low and high range in the 
atmosphere and to associate it with the Geo Area in 
latitude and longitude. By using this graphical tool, 
a user builds direct spatial queries for Earth-based 
observations from remote sensing platforms. When 
finished building queries, a user presses the “OK” 
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Figure 3. Geographical Query window. 
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button at the bottom of the Geographical window to 
update the Build Query control panel. V alues can be 
changed in either the Geographical section of the Build 
Query control panel or the graphical windows. 

The Query Parameters section of the control panel 
also allows a user to save the query parameters as new 
Workspace objects or to clear the control panel and 
bring in pre-saved Build Query objects. A user can 
create various combinations of query parameters and 
sources to run multiple queries against the catalog data 
bases. Allowing users to create multiple queries 
demonstrates the power of a graphical-based user 
interface, which makes data base query building faster 
and more efficient. 

3.4 Running Queries 

The B uild Query control panel contains the 
windows for building queries and the Run Query 
button, which activates the query process. The VISTA 
prototype was initially built relying on an RDBMS 
query mechanism, making it more practical for BDC 
use because there is an INGRES catalog dependency. 
When this version is run, SQL commands are built and 
sent to query against the INGRES tables containing the 
appropriate VISTA metadata. At BDC, a user running 
the INGRES version of VISTA on a workstation 
would use the interface to build and run the query, and 
when he or she activates the run query, the session 
management layer of VISTA takes the SQL com- 
mands and sends them over the network to back-end 
main frame machines that host the INGRES tables. 

The information retrieved from the tables is then 
passed back to the individual workstation and VISTA, 
which stores the incoming metadata values into 
appropriate dynamic variables. 

In addition to the INGRES-dependent version, 
VISTA also offers a completely portable non- 
IN GRES vers ion for the data base management 
system. With this system, users are only dependent 
on their workstation platform to run VISTA. In this 
case, when a user hits the Run Query button, a 
completely internal file management system is used 
to provide query results to the system. Currently 
this is the more mature system under development 
because it is this version that will be distributed for 
review and use. This version uses data structures to 
directly search indexed metadata flat files as 
opposed to relational tables. 


3.5 Editing and Viewing Query Results 

When a user presses the Run Query button in 
the Build Query control panel, the Edit/View area is 
automatically activated. The Edit/View control 
panel (see Figure 4) is the area that gives the user 
the first view of the metadata results. This panel is 
divided into two main sections: Data List (query 
results list box) and View List (selected data items 
for viewing). The Data list provides the user with a 
column-formatted table that displays the data items 
matched by the query and their associated key 
metadata values. A typical line in this list would 
display the associated source, the item catalog 
identification number, and key physical parameters 
such as positional and temporal information. Users 
can select individual items, groups of items, or all 
of the Data List return values to be put into the 
View List. Mechanisms to save and print the 
resulting lists are a planned VISTA enhancement. 

The View List is specifically created for direct 
visualization of metadata items. The View List 
works with a series of graphical visualization 
windows specifically created for geographical- and 
celestial-based data. At the bottom of the Edit/View 
control panel are five window buttons: Map View, 
Vista View, Celestial, Line-of-Sight, and 
QuickLook. The Map View window is similar to 
the Geographical Query window. It provides a map 
view of the Earth for Earth-based observations, 
along with multiple option buttons for viewing the 
selected metadata items. This window gives the 
user a two-dimensional projection of the observing 
platform over the Earth, a projection of the FOV 
corner points, the line-of-sight vector, and the area 
covered in the atmosphere or on the Earth’s surface 
as a geometric projection. 

The Vista View window (see Figure 5) provides 
a similar view, but as a three-dimensional interactive 
window that allows the user to move around the 
Earth as if above the Earth in a “meta-observer” 
position. Movements include panning, zooming, 
changing latitude, longitude, and altitude, or chang- 
ing the look point from the satellite’s position. A 
position of the satellite is plotted, as well as similar 
information found in the Map View window. The 
user also has the option to change coordinate system 
plots or turn Earth lighting models on and off. The 
Vista View clearly offers some of the more powerful 
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tools for data set evaluation. A third window, 
directly tied to the previous two, is the Line-of- 
Sight window. This graphic is similar to the Vista 
View in that it gives you a three-dimensional view; 
however, this window was designed to give the user 
the sensor eye perspective of an observational 
event. The window reflects the actual position of 
the instrument above the Earth’ s surface and shows 
the view from the FOV dimensions. 

The QuickLook and Celestial viewers are indepen- 
dent from the other three buttons (not available for 
prototype release) . The QuickLook window will 
provide a representative item from each data set, such 
as an image, or spectra, or even line data, depending on 
the type of instrument used. The Celestial window will 
display a celestial background map using a celestial 
catalog of choice, and it will overlay the areas ob- 
served. Celestial object information will be available, 
such as location, magnitude, and object type for stars 
and extended objects. 

All of the Edit/View windows have been 
designed to help users make decisions about certain 
data results by viewing both the metadata lists and 
the visualization at the same time. At any time, the 
user may save the Edit/View list as an object to 
reside in the Workspace area. A user may also save 
portions from these lists by using the Build Analy- 
sis Sets button to build analysis sets that contain 
pointers to the actual data. 


3.6 Analyzing Data 

After refining data lists in the Edit/View section of 
the interface, a user may want to make the next natural 
progression — that is, work with the actual data. The 
analysis sets that are built from the metadata in the 
Edit/View section are the vehicle for image processing 
and analysis within the system. Each analysis set, 
based on the list of selected data items, contains the 
necessary information for a user to pull up online data 
and to do this within an analysis tool without leaving 
the VISTA system. 

When a user presses the Analyze button in the 
Main Window, the Analysis control panel (see 
Figure 6) is activated. This window lists the data 
items that were chosen for the particular analysis set 
and provides optional selections for the user, such as 
image processing tools, analysis packages, and 
various data modeling tools. The Analyze section of 
VISTA has been designed to be modular so each 
user can implement a set of favorite tools or incorpo- 
rate user-developed tools into the VISTA frame- 
work. Standard proprietary data processing tools 
such as IDL®, IRAF®, SAO Image®, PV-Wave®, 
AVS®, Iris Explorer®, or Data Explorer® can have 
direct links built into the VISTA Analysis section. 

At BDC, direct links for several of these packages 
will be implemented. Sessions can be started and 
data loaded from within the Analysis control panel, 
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making some of the often cumbersome initialization 
steps and data loading processes transparent to the 
user. Even better, non-proprietary tools such as 
Khoros (©Khoral Research, Inc., 1994) or NCSA 
routines can be packaged with VISTA and distrib- 
uted to users. In fact, the VISTA prototype uses 
Khoros as its main image processing and analysis 
package. Khoros was initially developed at the 
University of New Mexico as a DoD project and 
since has become an open software package that 
users can get via Internet. Because BDC is a 
member of the Khoros consortium, Khoros pro- 
grams can be developed by the VISTA team and 
distributed along with the package to users. 

The VISTA prototype works with defined data 
processing pipelines that involve the Khoros tool, 
Cantata, which is a visual programming environ- 
ment tool. This system presents a user with a 
workspace from which program pipelines can be 
built using available Cantata routines or user- 
developed routines hosted by Cantata. Cantata 
displays various image and data processing routines 
as glyphs (see Figure 7) that can be connected 
together to form pipelines. When run, each glyph is 
activated and passes on I/O to the next stage in the 
pipeline. VISTA uses a basic pipeline for data 


display as its default Cantata pipeline. That is, when 
a user brings in a particular analysis set through the 
Analyze area and activates Cantata with the default 
pipeline, Cantata is automatically initiated and the 
pipeline and data selection loaded. At this point, the 
user is within the Cantata system without leaving 
VISTA. To run the pipeline, a user only has to click 
the run button on the left panel of Cantata. Figure 7 
shows the Cantata interface, as brought up using 
VISTA, and shows the output of image display data 
for a particular analysis set. 

VISTA also has been designed to enable users 
to make use of existing workstation tools (i.e., 
Image Tools or XV) or user-defined tools to per- 
form screen captures (RGB, TIFF, Postscript, etc.) 
of any combination of VISTA and analysis plot 
windows. These screen captures can then be used to 
create hard-copy output or animation sequences for 
videos or workstation movie routines. 

4. FUTURE ADAPTATIONS AND 

ENHANCEMENTS 

The VISTA prototype was released as a beta 
version in October 1 993. This elicited positive 
responses from the beta reviewers and provided the 


cm I Cantata 

|| q — —I Cantata, Visual Programming Environment for the KHOROS System — 

115=3 EPIT I PROGRAM UTILITIES 1 INPUT SOURCES | CONVERSIONS | IMAGE PROCESSING [ SIGNAL PROCESSING | 

WORKSPACE | PHOTON UTILITIES | OUTPUT 1 ARITHMETIC | IMAGE ANALYSIS [ REMOTE &GIS | 



HELP 


QUIT 


Efl 


i j f-Mi a 


get_n file 




a a 

input_vista 


H s 

input_vista I 


HE 


countjoop 


h a 


input 



ante 


sub.region 


—| Fits 2AAAa 1B370 



J 1—1 L 


JJHJ 


219 X 219 x = 22805 


I I 


Figure 7 . Cantata. 

















288 

team with a basis for improvements for prototype 
updates. The next prototype version is targeted for 
release at the end of January 1994. While work 
continues on the prototype system, the VISTA team is 
also undertaking a formal object-oriented design, 
analysis, and implementation of a new development 
version using C++. Using a language with object- 
oriented features allows for greater modularity of code 
for both the graphical interface and the data base 
management system. The use of C++ promotes a 
software system more resilient to changes in require- 
ments over time and the availability of application 
objects for other projects in the same domain. 

One of the goals facing the VISTA team is to 
ultimately produce a system that can be run on 
multiple X workstations. This can be achieved by 
designing a pure X/Motif system or by porting the 
GL windows to other platforms by using tools, such 
as Open GL (released by SGI to other vendors, such 
as IBM, DEC, Sun, and HP). These options are 
currently under analysis by the team. 

Enhancements to the prototype system will 
extend its use while the next version is under design 
and development. Some enhancements have already 
been planned, such as the incorporation of higher 
resolution maps for Earth views, additional graphi- 
cal queries as part of the Build Query section that 
are more particular to observational events, the 
addition of mission analysis windows (including 
detailed views of platforms and their orbits), the 
addition of special queries and visualization (such 
as viewing angle from a spacecraft platform), and 
the additi on of graphics and analysis pipelines that 
handle the visualization of multiple types of data. 
These and suggestions from prototype reviewers 
will help the VISTA team drive a more robust data 
access and handling system. 

For more information about the VISTA system 
and its use, please contact Dr. Edmund Dombrowski 
(VIST A Coordinator) at 202-404-783 1 , or send E-mail 
to dombrowski@bdcv8.nrl.navy.mil. All rights and 
privileges to the VISTA name and software are 
copyrighted, 1 99 1 , to the Naval Research Laboratory. 
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PATHFINDER is a software effort to create a flexible, 
modular, collaborative, and distributed environment 
for studying atmospheric, astrophysical, and other 
fluid flows in the evolving networked metacomputer 
environment of the 1990s. It uses existing software, 
such as HDF, DTM, GEMPAK, AVS, SGI Explorer, 
and Inventor to provide the researcher with the ability 
to harness the latest in desktop to teraflop computing. 
Software modules developed during the project are 
available in the public domain via anonymous FTP 
from NCSA. The address is ftp.ncsa.uiuc.edu, and the 
directory is /SGI/PATHFINDER. 

*Currently affiliated with the Department of Marine, Earth, 
and Atmospheric Sciences, North Carolina State University 


1. INTRODUCTION 

During the 1990s, new hardware and software 
tools are providing researchers with enhanced 
capabilities for parallel processing, scientific 
visualization, data management, and distributed 
applications. Hardware/software advances range 
from desktop multimedia personal computers to 
large massively parallel systems. The latter will 
eventually provide teraflop ( 1 0 12 floating point 
operations per second) capabilities that will be 
useful for pursuing answers to “grand challenge” 
and other important scientific questions. For 
example, teraflop capability will enable the study of 
weather changes occurring across the entire United 
States to be carried out with unprecedented resolu- 
tion (1 to 2 km horizontal resolution and 30 vertical 
levels). This is illustrated in Kaufmann and Smarr 
(1992, p. 18), where each data array or lattice (e.g., 
temperature, pressure, and wind) in the simulation 
model has approximately 500 million elements 
(more than 100 times those in use today). These 
arrays will be stored in very large memories with 
maximum configurations well beyond the multiple 
billion bytes available today. In addition, the 
handling and storage of data will be expedited by 
larger disk farms, the use of new storage media 
(e.g., new DD-2 tapes hold between 25 and 165 
gigabytes of data per tape), and high-speed network 
communications (multiple gigabits per second) . 
Interrogation and visualization of this vast amount 
of data will be handled by powerful visualization 
systems that use computer and software resources 
distributed through the user’ s working environment. 

A term commonly invoked to describe the 
user’ s environment consisting of local and remote 
computers connected by high-speed networks is the 
“metacomputer.” Productive use of this environ- 
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ment requires software that allows a researcher to 
distribute and communicate with a collection of 
processes running on different machines and to 
store, retrieve, and share information and data. 
Toward this end, a software effort began several 
years ago at the National Center for Supercomput- 
ing Applications (NCSA) that uses high-perfor- 
mance computing, networking, and data handling 
capabilities. The aim of the project, known as 
NCSA PATHFINDER or simply PATHFINDER 
(Probing ATmospHeric/AsTropHysical Flows in an 
INtegrated and Distributed EnviRonment), is to 
create a flexible, modular, collaborative, and 
distributed environment for studying atmospheric, 
astrophysical, and other fluid flows [Wilhelmson et 
al., 1993]. More specifically, this interactive 
metacomputing environment is being built to: 

• Provide standardized access to and 
centralized control over files, processes, 
hardware, and other resources distributed 
across the metacomputer environment 

• Script, monitor, or interactively control a 
sequence of processes that spans multiple 
machines 

8 Direct a variety of existing applications to 
transparently transfer data between each 
other and remote servers over high-speed 
networks using standardized data formats 

• Manage, access, and relocate large hetero- 
geneous data sets (anything from images to 
gigabyte data files), along with their 
descriptive metadata, in a common direc- 
tory space even though they may physically 
reside on a number of different platforms 

• Browse or search the contents of these data 
sets and their descriptive information and 
user-supplied annotations using an 
interactive multimedia hypertext tool 
(Mosaic) 

• Visualize and compare multiple data 
streams (such as model and observational) 
using a diverse collection of visualization 
software (such as SGI Explorer) and 
hardware (such as video laser disk, virtual 
reality, and high definition display) 

8 Capitalize on existing software capabilities 
whenever possible 

In this paper, some of the components being 
developed to achieve these objectives are described. 


Visualization Techniques in Space and Atmospheric Sciences 

2. GEMVIS 

GEMVIS (GEMPAK Visualization) is an 
outgrowth of the NASA-funded project titled “A 
Distributed Analysis and Visualization System for 
Model and Observational Data.” This software is a 
collection of modules that extends some of the 
capabilities of GEMPAK [desJardins and Petersen, 
1989; desJardins et al., 1991] to a three-dimensional 
graphics environment. GEMPAK (General Meteo- 
rological Package) is a stand-alone meteorological 
software package that has become increasingly 
popular throughout the atmospheric science com- 
munity with distributions to at least 55 institutions, 
including NASA centers, other government re- 
search laboratories and organizations, universities 
and colleges, and industry (available from COSMIC 
and Unidata) . It is used for ingesting and decoding 
weather data. Extensive diagnostic capabilities are 
provided for observational and model data. Map 
transformations are available, along with capabili- 
ties for two-dimensional contour plots, streamlines, 
wind fields, thermodynamic profiles, and isentropic 
cross sections [desJardins and Petersen, 1989]. 

To extend these display capabilities to three 
dimensions and to include animation, portions of 
GEMPAK that use GEMPAK grid files (analysis 
and data transformations) have been encapsulated 
into SGI Explorer Version 2.0 (1991) and AVS 
Version 4.0 ( 1 992) modules. These modules can be 
used with other Explorer and AV S modules for 
three-dimensional viewing and animation. Explorer 
and AVS are two different implementations of a 
general visualization environment in which the user 
“builds” his or her application by connecting 
modules needed for a specific purpose. 

Modules that have been and are being devel- 
oped will (E refers to SGI Explorer modules and A 
to AVS modules): 

• Extract an arbitrary two-dimensional cross 
section vertically (this is based on selecting 
end points of the cross section on a geo- 
graphical map) (A,E) 

• Read GEMPAK grid files and derive all 
GEMPAK diagnostic functions along with 
coordinates (A,E) 

• Derive GEMPAK grid diagnostics and 
coordinates based on data sets other than 
GEMPAK grid files (E) 
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• Draw a topographic map of a given area 
and a three-dimensional topographic 
surface to be registered in any requested 
GEMPAK map projection (A,E) 

• Generate three-dimensional vector displays 
with controls on arrow style, scale, and 
coloring (A, E) 

• Reformulate the basic internal data format to 
include metadata (referred to as a grid) 
permitting the passage of registration, map 
projection, and annotation information, as well 
as provide a missing data indicator ( A,E) 

• Do simple animation of accumulated 
rendered images (E) 


• Display two- and three-dimensional text 
and annotations (E) 

• Provide a number of small utility modules 
to fill in gaps in the functionality of 
existing software (E) 

Only the first four items above are related 
specifically to GEMPAK. The others do not 
require any GEMPAK functions. Furthermore, the 
display modules do not use any GEMPAK display 
routines. No provisions have been made to incorpo- 
rate GEMPAK display routines into the SGI Ex- 
plorer or A VS framework. An example of 
GEMVIS output is shown in Figure 1 . 



Figure 1. A computer screen of a GEMVIS visualization of a potential vorticity surface for a large storm system approaching the 
east coast of the U.S. The two-dimensional image contains the distribution of potential temperature of the storm in the vertical 
plane shown. The topography on the surface is also rendered, and State contours are drawn. 
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3. DISTRIBUTED TOOLS 

It is important to have software communica- 
tions tools for handling communications between 
computers to use today’s distributed computing 
environment. The Data Transfer Mechanism 
(DTM) was developed at NCSA for this purpose. It 
is an application interface that enables inter-process 
communication by supporting message passing 
between distributed processes. Several features 
make it attractive for scientific computing. First, the 
message protocol is designed to be simple and 
efficient when transferring scientific data, such as 
gridded data, raster images, and polygonal data sets. 
Second, data type conversion is provided for these 
classes of messages and has been optimized on 
several supercomputing architectures. These are 
important considerations when dealing with ex- 
tremely large data sets. 

Modules have been implemented that utilize 
DTM with SGI Explorer to retrieve data stored on 
another computer or to pass data from an active 
simulation on one computer to the Explorer envi- 
ronment. Data are either stored or passed in the 
Hierarchical Data Format (HDF), which has been 
adopted as the baseline EOSDIS standard format 
for science and science-related data. Specifically, it 
will be used for standard data product generation 
and for data ingest and distribution. HDF also 
provides access to NetCDF files. Other formats 
require translations, many of which can be handled 
by software developed by other groups. 

One example of the use of these modules was 
demonstrated at the Special Interest Group on 
GRAPHICS (SIGGR APH) meeting in Chicago 
during the summer of 1992. Severe thunderstorm 
phenomena were explored through coupled model 
initiation, simulation, analysis, and display. The 
CRAY Y -MP system at NCSA was connected by a 
T3 ( 1 .54 megabits per second) communications link 
to an SGI VGX 440 on the conference exhibit floor. 
The CRAY Y -MP supercomputer handled certain 
model computations, while the SGI VGX at the 
exhibit managed the user interface for handling 
model Initiation and parameter changes as well as 
two- and three-dimensional graphical displays. 
Using DTM, new model data were analyzed and 
visualized under user control as they became 
available from the executing model simulation. 


Real-time animations were created by capturing 
frames on the SGI as they were produced, storing 
them on laser disk, and then playing them back. An 
example of a screen display is shown in Figure 2 
and another in a recent article by Comerford 
(1992). 

More recently, the CM-2 and the SGI have been 
coupled. DTM is used to transfer data from 
memory on the CM-2 during an ongoing simulation 
to memory on the SGI. For example, PATH- 
FINDER modules have been used to visualize data 
from a density current (or thunderstorm outflow) 
simulation. The display of live model simulations 
using relatively small simulation grids (for real- 
time computations) has been carried out success- 
fully during several video teleconferences held 
between NSF supercomputer centers over dedicated 
T1 video teleconferencing links. 

4. PATHFINDER MODULES 

The modules in PATHFINDER, including those 
referred to above, can be categorized as modules 
used in data acquisition, data mapping, data analy- 
sis, visual idioms, and data presentation. Data 
acquisition modules can read HDF files or commu- 
nicate via DTM with executing flow models. HDF 
files include scientific data, images, and color 
palettes. The HDF scientific data reader incorpo- 
rates a user-friendly user interface with support for 
multiple lattice outputs and vector fields (e.g. , flow 
or vorticity) as well as full n-D support for I/O. 

The HDF module (ReadDF) also has backward 
compatibility with older HDF libraries and provides 
detailed information of file contents (e.g., calcu- 
late min, max). 

Data mapping modules are available to process 
the data. The processing includes arithmetic func- 
tions and arbitrary vertical slicing in a three- 
dimensional domain. The modules for arithmetic 
operations provide examples or building blocks for 
users to develop modules using their own arithmetic 
sequences. The vertical slicing module provides a 
top-view window of the domain. In this separate 
window, contours from a selected horizontal plane 
through the data or from a surface map (e.g., map of 
the U.S. when GEMPAK is used) are used as a 
reference for selecting two points (by clicking on 
two different locations) through which a vertical 
slice is to be displayed. The vertical slice can be an 
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Figure 2. An SGI screen showing a visualization of a growing severe storm along with the associated dataflow network, a list 
of various modules for use informing a network, an interactive window for starting and stopping the simulation on the Cray Y- 
MP and matching variables with SGI Explorer lattices, and a window showing min/max values of the data fields as they are 
computed. The cloud surfaces and horizontal contours are updated automatically as the simulation proceeds. 


image, contours, or a set of vectors. 

The data analysis modules provide numerical 
information about the data. They include commonly 
used functions, such as min/max and trajectory 
calculations. They also include capabilities for 
data analysis available in GEMPAK. 

Additional and improved visual idioms were 
developed for PATHFINDER. A new contour 
module provides the flexibility for selecting contour 
intervals or the number of contour levels. It has 
options for showing or hiding zero contour lines 
and for dimming negative contour lines. The 
contour data range can be fixed or data dependent. 
The former is particularly useful for creating 
contour animations. The new vector module pro- 
vides options for different vector styles (e.g., line, 
line with 2/4 cross-line head, line with cross/solid 
arrow head, and faded line), color mapping based 


on other data fields, and different vector scaling. 
The controls are adjusted according to the range 
of the input data. 

Several annotation modules have also been 
added to the system. The user can annotate a 
vertical axis with tick marks and labels, place color 
maps in the image, and create two-dimensional or 
three-dimensional titles that incorporate metadata. 
This was made possible in the Explorer data flow 
environment by developing a new data type called 
“grid.” It includes the information in the original 
lattice data type, as well as descriptive information 
found, for example, in an HDF file (such as variable 
name, array dimensions, and time of data). 

Particle advection modules have been written to 
generate points in a user-specified subvolume of 
the input data domain and to advect all the points 






294 


Visualization Techniques in Space and Atmospheric Sciences 


in space for the duration of display interval. Both 
time-dependent and independent particle trajecto- 
ries can be computed. 

Data presentation modules have been developed 
for transforming the final image/geometries for use 
with devices other than the original workstation 
graphics window. For example, images can be 
recorded to laser disk and played back on an SGI 
workstation. In addition, data have been fed in a 
boom, a virtual reality device that displays stereo 
images [Sherman, 1992]. The viewpoint changes as 
the boom is moved by the viewer. 

5. ANIMATION TOOL 

Animations can be created and played in 
PATHFINDER, not only using SGI Explorer 
modules but also using a new tool embedded within 
PATHFINDER called the Inventor Animator Tool 
(lATool). Inventor is an object-oriented three- 
dimensional toolkit for rendering and is used in 
providing the basic rendering capabilities in SGI 
Explorer. It utilizes GL commands to produce 
visualizations from three-dimensional objects, such 
as isosurfaces, vectors, contours, and motion 
trajectories. The development of this IATool 
outside the Explorer environment was motivated by 
the desire to streamline the creation of animations 
so that they could be done easily and on a daily 
basis in a distributed computing environment and 
based on very large data arrays/sets. These anima- 
tions involve not only the rendering of objects but 
also layout, choreography, and annotation. Chore- 
ography, including rotation and movement of 
objects and placement of lighting sources, is 
important in investigating and communicating 
important flow features as they evolve over time. 
Annotation of three-dimensional animations is also 
important and has lagged behind that available in 
still images. 

The IATool allows a researcher to assemble a 
detailed and professional-looking animation from 
three-dimensional objects produced by SGI Ex- 
plorer or other visualization tools. For example, a 
researcher might use some Explorer tools (e.g., 
ReadDF or IsosurfaceLat) to produce isosurfaces of 
his data. To get the most information from these 
isosurfaces, they can be animated, be looked at 
from all sides, and/or integrated with other visual- 


ization idioms (e.g., contours or vectors). Using 
IATool, a sequence in which these tasks are accom- 
plished can be quickly constructed and saved for 
further inspection or sharing with a colleague. 

The IATool allows the specification of both the 
scene and the animation of the scene in a highly 
interactive manner from specified geometries. The 
interface is very similar to that used in the standard 
Explorer Render module so it should be immedi- 
ately familiar to Explorer users. Objects are 
manipulated and positioned with the mouse. They 
appear exactly as they will in the final product. The 
position of objects relative to each other or within 
the frame can be edited along with the object’ s size 
and surface properties (e.g., color, shininess, and 
transparency). Components that effect the whole 
scene can also be edited, such as lighting and 
viewpoint (camera position). Furthermore, stereo 
capabilities are also being implemented for display 
on a workstation screen or within an immersible 
virtual reality environment (e.g., the CAVE, a 
technology environment involving high-resolution 
rear- projection on multiple walls of a room [Cruz- 
Neira, 1992]). 

In addition to the above features, any of the 
possible adjustments to the scene can be animated, 
including the parameters that generate the visualiza- 
tions. Animation is specified by setting what are 
called keyframes. Keyframes specify what certain 
“key” frames of the animation should look like and 
are most often used in the final production of an 
animation. The tool uses interpolation methods to 
generate the frames in between the keyframes. A 
keyframe is set at the beginning of an animation 
specifying how everything should appear initially. 
Another keyframe is then set at some later frame 
specifying how things should look then. For 
example, an object might be shown from the front 
in the first frame and then from behind in a later 
frame. The viewing position in between keyframes 
is then automatically interpolated. When all the 
frames are played, the object would appear to 
rotate. More complex animations would involve 
the use of a number of keyframes. 

Animations can be replayed directly from 
within IATool, although the maximum speed is 
limited by the time it takes to render each frame. 

For production of a final product, a request would 
be issued for the tool to render each frame and then 
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automatically pass the frame to a utility program to 
generate SGI or MPEG movie files. Other utility 
programs could send the output to video tape or 
laser disk (with suitable hardware). 

6. THE FUTURE 

The evolving electronic and multimedia 
metacomputing environments of the 1990s will 
provide researchers with new capabilities that will 
need to be harnessed through the integration of 
different computing, communications, data 
handling, and visualization capabilities. In 
addition, improved software for working together 
from remote sites, as well as access to the 
increasing volume of simulation, observational, and 
reference data, is needed. High-quality three- 
dimensional animations will be used routinely and 


shared among the research and educational 
communities. The media for carrying these images 
will range from high-quality color hard copy for 
still images and video tapes for animations to 
multimedia online formats provided through NCSA 
Mosaic* (see Figure 3) and multimedia electronic 
mail that combines text, images, movie files 
(MPEG/Quicktime), and audio. The sharing of 
information will be available to a large audience as 
low-cost workstations begin to appear that have 

*Mosaic is a cross-platform, hypermedia information 
retrieval and team collaboration system encompassing existing 
information servers such as Gopher, WAIS, and Worldwide 
Web in addition to providing new multimedia services. It is 
available from NCSA through anonymous FTP. On the Internet 
use ftp ftp.ncsa.uiuc.edu and logon using anonymous for the 
name and your local electronic address for the password. Then 
enter get README. FIRST and follow directions. Enter quit 
to exit FTP and return to your local host. 



Figure 3. Two pages from a Mosaic document showing examples of severe storm research using PATHFINDER. By clicking on 
the storm outflow simulation (note dashed underline), the first page of results describing the simulation appears. By clicking at 
the bottom of the second page, an animation of the outflows as they collide would be shown. 
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high-quality rendering, stereo, and direct video 
capabilities. NCSA PATHFINDER and other 
efforts will strive to meet the evolving needs of the 

research community. 
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